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I INTRODUCTION  AND  SUMMARY 


In  recent  years  there  have  been  several  successful  efforts  to  ex- 
tract from  ground-based  telescopic  images  some  of  the  information  that 
atmospheiic  seeing  (degradation  of  the  image  by  atmospheric  inhomo- 
geneities) destroys  in  normal  astronomical  photography.  The  best-known 
technique  of  information-recovery  is  the  "speckle  interferometry"  of 
Labeyrie  (1970).  A survey  of  such  techniques  was  recently  published  by 
Code  (1973).  All  the  techniques  that  have  so  far  been  developed  have 
modest  aims.  They  can  at  best  recover  only  a small  fraction  of  the  lost 
information.  On  the  other  hand,  Babcock  (1953)  proposed  twenty  years 
ago  a system  aimed  at  the  total  restoration  of  images.  Babcock  proposed 
to  build  a telescope  with  an  actively  mobile  optical  system,  capable  of 
compensating  completely  the  effects  of  atmospheric  seeing  in  real  time. 

I do  not  know  whether  Babcock’s  ideas  received  any  serious  attention  from 
telescope  designers  in  the  1950's.  It  would  have  been  very  difficult  to 
implement  Babcock's  scheme  twenty  years  ago,  because  we  lacked  the  tech- 
nology of  on-I >ne  computers.  Since  on-line  computers  and  efficient  ima?e- 
monitoring  devices  have  now  become  available,  the  time  is  ripe  for  a seri- 
ous study  of  the  feasibility  of  Babcock's  proposal.  The  purpose  of  the 
pic  sent  series  of  papers  is  to  provide  a theoretical  basis  for  such  a 
study.  My  collaborator,  Dr.  Richard  Muller  (1974),  is  simultaneously 
carrying  out  computer  simulations  and  designing  optical  equipment  that 
will  constitute  the  experimental  part  of  the  study. 
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Dui xng  the  last  few  years  several  groups,  apparently  unaware  of 
Babcock's  paper,  have  independently  made  proposals  of  active  optical 
systems  that  are  in  essence  modernized  versions  of  Babcock's  system. 

I am  grateful  to  Dr.  D.  L.  Fried  of  Optical  Science  Consultants,  and  to 
Dr.  R.  S.  Leonard  of  Stanford  Research  Institute,  for  information  about 
two  such  proposals.  I will  not  discuss  these  proposals  in  detail  since 
they  involve  company  proprietary  information.  1 only  wish  to  acknowledge 
that  they  provided  the  initial  stimulus  that  led  me  to  develop  the  theory 
presented  in  these  papers. 

The  image -improvement  concept  of  Babcock  involves  fcr.r  essential 
components:  (1)  A mirror  or  optical  correct ing-plate , composed  of  ele- 

ments that  can  be  moved  rapidly  and  independently  in  a direction  normal 
to  the  surface,  (2)  an  image-detector  or  television  camera  system  that 
records  instantaneously  the  positions  and  times  of  arrival  of  photons 
as  they  reach  the  image  or  images,  (3)  a data-processing  system  with  an 
on-line  computer  that  accepts  data  from  the  image-detectors  and  calculates 
the  desired  response  of  the  various  elements  of  the  mirror,  and  (4)  a 
servosystem  that  moves  the  mirror  elements  as  directed  by  the  computei . 

Of  these  four  components,  three  (1,  2,  and  4)  are  hardware  items  whose 
existence  I shall  take  for  granted.  Their  implementation  is  a problem 
for  the  experimenter  rather  than  the  theorist.  Since  the  amplitude  of 
motion  required  of  the  mirror  elements  is  only  a few  optical  wavelengths 
(of  the  order  of  10  4 cm),  and  the  atmospheric  fluctuations  to  which  they 
are  required  to  respond  have  frequencies  in  the  range  from  1 to  1000  Hz, 
the  required  velocities  are  only  of  the  order  of  millimeters  pei  second 
and  should  be  easily  attainable  by  means  of  piezoelectric  crystals  or 
other  conventional  transducers.  In  these  papers  I shall  be  primarily 
concerned  with  Item  3,  the  software  item.  The  essential  question  that 
has  to  be  answered  is  the  following:  What  kind  of  program  should  be  given 

to  the  computer,  to  convert  information  arriving  at  the  image  into 
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instructions  telling  the  mirror  what  to  do  to  improve  the  quality  of  the 
image9  It  is  not  self-evident  that  any  program  exists  that  '/ill  enable 
the  image  to  lift  itself  up  by  its  bootstraps  in  this  fashion.  We  shall 
outline  the  construction  of  such  a program,  and  show  that  images  of  sub- 
stantially improved  resolution  can  in  principle  by  obtained  by  using  it. 
How  great  the  improvement  is,  and  how  wide  the  class  of  astronomical 
object  that  is  accessible  to  this  technique,  can  only  be  determined  by 
operational  trials. 

The  compensation  of  atmospheric  seeing  is  possible  only  under  two 
severe  limitations.  First,  the  highly-resolved  optical  field  must  be 
very  narrow,  at  best  like  a tiny  fovea  in  the  visual  field  of  the  human 
eye.  The  reason  for  the  narrowness  of  the  field  is  that  a single  con- 
figuration of  the  mirror  surface  is  supposed  to  compensate  for  the  dis- 
tortion of  the  wavefronts  of  all  photons  arriving  at  the  mirror.  This 
is  possible  only  if  all  photons  arriving  at  a given  point  on  the  mirror 
pass  through  the  same  atmospheric  inhomogeneitios.  Therefore  the  angular 
size  of  the  correctly-compensated  visual  field  is  equal  to  the  angular 
size  of  the  disturbing  patches  in  the  atmosphere  as  seen  from  the  tele- 
scope. Estimates  of  the  size  of  the  patches  range  from  seconds  to  minutes 
of  arc;  some  of  the  relevant  observational  data  are  summarized  by  Bufton 
and  Genatt  (1971),  The  wide  scatter  of  the  data  indicates  that  patch 
sizes  are  highly  variable  from  place  to  place  and  from  night  to  night. 

One  of  the  byproducts  of  an  operational  image-improvement  system  will  be 
accurate  information  about  the  scale  and  spectrum  of  atmospheric  inhomo- 
geneities. 

The  second  basic  limitation  on  the  compensation  of  seeing  is  the 
problem  of  photon  statistics.  There  must  be  enough  photons  arriving  at 
each  resolution  element  of  the  image  during  each  atmospheric  fluctuation 
period,  to  provide  reliable  information  about  the  momentary  distortion 
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of  the  wavefronts.  The  image-improvement  system  is  therefore  in  principle 
limited  not  only  to  narrow  fields  but  to  bright  objects.  TTiere  are  two 
independent  sources  of  noise  in  a telescopic  image,  atmospheric  noise  and 
photon  noise.  The  image- improvement  system  works  by  feedback  of  signals 
from  the  image  to  the  mirror.  The  feedback  achieves  a reduction  of 
atmospheric  noise  at  the  cost  of  an  amplification  of  photon  noise.  The 
optimum  image-improvement  occurs  when  the  two  sources  of  noise  become 
roughly  equal.  When  a bright  object  is  observed  in  a conventional  tele- 
scope, the  atmospheric  noise  is  enormously  greater  than  the  photon  noise, 
and  so  a large  improvement  can  in  principle  be  obtained  by  trading  off 
the  atmosphere  against  the  photons.  When  a very  faint  object  is  observed, 
the  two  sources  of  noise  are  comparable,  and  a tradeoff  offers  no  hope 
of  advantage.  Detailed  analysis  allows  us  to  replace  the  qualitative 
words  "bright"  and  "faint”  in  the  foregoing  statements  by  quantitative 
est  Jma  tes. 

There  are  two  distinct  types  of  image-improvement  systems — the 
single-image  type  and  the  multiple-image  type.  The  single-image  type  is 
like  a conventional  telescope,  with  all  the  light  going  to  a single  image 
and  pll  the  input  to  the  feedback  system  coming  from  a single  image 
monitor.  The  multiple- image  type  uses  an  array  of  secondary  mirrors  and 
prisms  to  split  the  light  from  the  primary  mirror  into  many  beams  that 
are  recombined  to  form  many  images.  Each  of  the  secondary  images  has  its 
own  detector,  which  acts  as  an  interferometer  and  feeds  back  information 
to  the  primary  mirror.  In  its  extreme  form,  the  multiple-image  system 
has  a number  of  images  equal  to  the  number  of  movable  elements  in  the 
primary  mirror.  The  extreme  multiple-image  system  makes  optimum  use  of 
all  the  information  contained  in  the  incident  light  and  gives  a final 
image  of  the  highest  possible  resolution.  This  optimum  performance  will 
be  very  costly,  requiring  large  numbers  of  precise  optical  and  mechanical 
linkages  in  addition  to  the  multiple  image-monitors  and  data  processors, 
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The  single-image  system  gives  a smaller  but  still  useful  image-improvement 
at  much  lower  cost.  Between  the  single-image  and  extreme  multiple-image 
systems  there  are  systems  of  intermediate  performance  and  intermediate 
cost.  Experience  will  decide  how  far  it  is  worthwhile  to  go  in  pursuit 
of  the  ultimate  performance.  The  initial  demonstration  model  of  an 
image- improvement  system  should  bo  a single-image  system  or  modest  aper- 
ture. When  a working  single- image  model  is  available,  the  next  step  will 
be  to  combine  several  copies  of  it  into  an  interferometer  array.  The 
array  can  then  lie  gradually  expanded  until  it  becomes  a complete  multiple- 
image  system. 

The  results  of  the  theory  can  be  summarized  in  a single  formula  for 
the  mean-square  phase  error  of  the  light  waves  arriving  at  the  image  of 
a single- image  system, 

-1/5 

12  2 

D = 0.5  (p  ~ q/d  v)  . (1.1) 

Hero  D is  measured  in  square  radians,  so  that  D < 1 is  a criterion  for  an 
acceptable  imago  with  dif f ract ion- limited  resolution,  d is  the  mirror 
diameter,  "V  is  the  maximum  angular  frequency  of  serious  atmospheric 
disturbances,  p is  the  diameter  of  a mirror  whose  diffraction  limit  is 
equal  to  the  resolution  limit  defined  by  the  uncorrected  atmospheric 
seeing,  is  the  cent rast incss"  of  the  image,  or  the  fraction  of  the 
light  intensity  involved  in  structure  inside  the  seeing  limit,  and  q is 
the  flux  of  detected  incident  photons.  These  quantities  are  more  pre- 
cisely defined  in  Eqs.  (1.8),  (5.37),  (5.10),  and  (10.11)  below.  The 
formula  (1.1)  is  to  be  understood  as  an  order-of-magnitude  estimate  only. 
On  the  right-hand  side  there  is  a numerical  coefficient  that  depends  on 
the  detailed  design  of  the  system.  To  calculate  the  coefficient  reliably 
would  require  a large-scale  computer  simulation  or,  even  better,  an 
operational  system. 
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The  movable  elements  of  the  mirror  are  chosen  to  have  diameter  P 
determined  by  the  seeing  conditions  for  which  the  system  is  designed. 

The  optimum  choice  of  P is  comparable  with  the  seeing-patch  diameter,  P. 
The  number  of  mirror  elements  contributing  to  the  image  is  then 


n = (d2/p2) 


(1.2) 


and  the  criterion  for  the  system  to  give  diffraction-limited  resolution 


-2 

l > nT]  o 


(1.3) 


3 = (V/p  ) 

O 


(1.4) 


The  flux  3^  depends  only  on  atmospheric  seeing  conditions  and  is  inde- 
pendent of  the  system  design.  As  a basis  for  rough  estimates  we  may  take 
-3-1 

p = 10  cm,  Y = 10  sec  , so  that 


3 = 10  photons  per  cm  per  sec 


(1.5) 


The  flux  (1.5)  would  correspond  to  an  object  of  visual  magnitude  14,  if 
photons  could  be  detected  with  100%  efficiency. 

For  a multiple-image  system  the  same  formula  (1.3)  applies,  where  n 
is  now  the  number  of  mirror  elements  contributing  to  each  image.  For 
the  extreme  multiple-image  system,  n = 2 independently  of  the  size  of 
the  system.  If  the  optical  field  contains  as  one  of  its  main  features 
an  object  with  sharply  defined  structure,  such  as  a star  or  a star-cluster 
or  the  edge  of  a planet,  we  can  take  7)  = 1,  and  Eq.  (1.3)  reduces  to 


3 > K 


(1.6) 
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II  the  image-detectors  have  a quantum  efficiency  of  10^,  then  Eq.  (1.6) 
implies  that  any  high-contrast  object  with  visual  magnitude 


<:  11 


(1.7) 


lution  in  an  extreme  multiple- 


can  be  imaged  with  arbitrarily  high  re  so 
image  system.  To  obtain  diffraction-limited  resolution  in  a single-image 

svstem  with  aperture  d meters,  we  need  a magnitude 


m <.  7 - 5 log  d 


(1.8) 


Single-image  systems  require  bright  objects.  But  the  numerical  estimates 
(1.7)  and  (1.8)  should  not  be  taken  too  seriously.  The  limiting  magni- 
tude that  an  image- improvement  system  can  handle  will  vary  stiongly  with 
the  seeing  conditions,  as  indicated  by  the  factor  p in  Eq.  (1.1).  At 
times  of  particularly  good  seeing,  the  limiting  magnitude  may  be  con- 
siderably fainter  than  Eq.  (1.7)  or  Eq.  (1.8).  The  true  limits  can  only 
be  discovered  by  experience  with  real  systems. 

The  plan  of  this  report  is  as  follows.  Section  II  and  Appendix  A 
provide  the  theoretical  basis  for  the  remainder  of  the  report.  Section 
II  contains  a general  prescription  for  optimizing  the  performance  of  any 
linear  servomechanism  whose  input  is  subject  to  quantum  fluctuations. 

An  exact  expression  is  obtained  for  the  fluctuations  of  the  output  of  the 
optimized  servo,  given  the  spectral  characteristics  of  the  input.  Sec- 
tions III  and  IV  contain  a precise  definition  of  a single-image  system 
and  a calculation  of  its  performance.  The  results  of  Sections  III  and 
IV  could  be  applied  to  the  optimization  of  the  design  of  an  actual  system 
by  numerical  simulation.  But  the  formulae  are  too  unwieldy  for  optimi- 
zation to  be  carried  through  in  closed  analytic  form.  Sections  V and  VI 
contain  a more  complete  analysis  of  an  idealized  single-image  system, 
with  approximations  that  need  not  be  made  in  a computer  simulation.  The 
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aim  of  these  sections  is  to  obtain  a detailed  understanding  of  the 
characteristics  of  an  optimized  system.  The  optimization  is  carried 
through  exactly,  and  the  parameters  of  the  optimum  program  for  feeding 
back  information  from  image  to  mirror  are  completely  specified,  A par- 
ticular choice  of  atmospheric  model  leads  to  the  result,  Eq.  (1.1).  In 
Sections  VII  through  X we  return  to  the  general  single-image  system  de- 
scribed in  Section  III,  and  optimize  it  using  a modified  version  of  the 
criterion  of  performance  defined  in  Section  IV.  We  do  not  make  the  "big- 
mirror  approximation"  on  which  the  explicit  optimizations  of  Section  V 
a ix;  based.  The  optimizations  of  Section  V are  carried  out  in  three  stages, 
fixing  - irst  the  y-dependence  of  the  feedback  kernel  N,  second  the  x-  and 
t-  dependence  of  N,  and  third  the  x-dependence  of  the  tremor-covariance 
function  F.  In  Section  VII  we  carry  through  the  first  two  stages  without 
approximation,  obtaining  analytic  expressions  for  the  optimum  N cor- 
responding to  any  given  F.  These  expressions  have  a simple  heuristic 
interpretation,  which  is  explained  ir  Section  VIII.  The  third  stage  of 
optimization,  leading  to  the  determination  of  the  optimum  F,  is  described 
in  Section  IX,  Exact  equations  defining  F implicitly  are  written  down. 

We  introduce  simplifying  approximations,  which  are  less  drastic  than  the 
big-mirror  approximation  of  Section  V,  and  still  allow  F to  be  calculated 
explicitly.  In  this  way  we  recover  the  results  of  Section  V,  under  con- 
ditions that  are  realistic  enough  to  be  applied  to  actual  telescopes.  In 
particular,  the  effects  of  the  mirror  edges  are  no  longer  required  to  be 
negligible.  The  results  of  Section  IX,  describing  the  characteristics 
and  performance  of  an  optimized  system,  include  as  special  cases  both 
the  idealized  big-mirror  model  of  Section  V and  the  rigid-mirror  pure- 
tilt  model  of  Section  VI.  Finally,  Section  X discusses  some  points  of 
detail  that  arise  in  applying  these  results  to  real  systems. 
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The  optimization  method  of  Sections  VII  through  IX  can  be  applied 
almost  without  change  to  multiple-image  systems,  and  leads  to  systems 
with  performance  defined  by  Eq.  (1.3)  or  Eq.  (l.b).  Multip  .e-image  sys- 
tems will  be  analyzed  in  detail  in  a subsequent  report. 

Many  questions  of  great  importance  in  practical  telescope  design 
are  not  discussed  in  this  report.  The  most  essential  question  left  for 
future  investigation  is  the  optimum  geometry  of  an  image-improvement 
system.  The  theory  of  Sections  III,  IV,  VII,  and  VIII  applies  to  a sys- 
tem of  arbitrary  geometry,  with  any  number  of  mirrors  in  an  array  of  any 
shape,  but  the  optimization  analysis  of  Sections  V,  VI,  IX,  and  X assumes 
a conventional  single-mirror  system.  It  seems  likely  that  in  optical 
astronomy  the  relative  merits  of  extended  arrays  and  single  dishes  will 
be  different  for  different  types  of  object,  as  is  the  case  in  radio 
astronomy  [for  a description  of  a multiple- mirror  optical  telescope, 
see  Weymann  and  Carlton  (1972)].  It  would  be  natural  to  use  a single- 
image improvement  system  with  a single-mirror  telescope,  or  a multiple- 
image  system  with  a multiple-mirror  telescope,  but  there  is  no  compelling 
reason  for  excluding  the  other  two  combinations.  The  combination  of 
multiple-mirror  with  multiple-image  system  would  allow  the  most  freedom 
in  the  choice  of  geometry.  Such  a system  with  a spaced-out  array  of 
mirrors  would  be  an  interferometer  of  very  high  capability.  The  applica- 
tion of  image-improvement  to  optical  long-base-line  interferometry  looks 
very  promising  and  deserves  a separate  analysis. 
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II  THEORY  OF  A QUANTUM-MECHANICAL  SERVOMECHANISM, 
WITH  APPLICATION  TO  A SIMPLE  INTERFEROMETEh 


We  define  in  abstract  mathematical  language  the  situation  that  is 
at  the  heart  of  an'  image- improvement  system.  We  have  a random  variable 
a(t),  not  directly  observable,  whose  statistical  properties  are  expressed 
by  the  expectation  values 

<a(t)>  = 0 , ( a(s)  a(t)>  = U(s  - t)  . (2.1) 

I 

We  have  a response  function  b(t),  which  we  are  free  to  program  using  a)1 
the  information  available  to  us  up  to  the  time  t.  In  the  applications, 
a ( t)  will  be  an  atmospheric  phase  shift  and  b(t)  a compensating  phase 
shift  applied  to  a mirror  element.  Our  information  comes  from  a sense 
organ  that  detects  photons.  The  probability  for  a photon  to  be  detected 
per  unit  time  at  time  t is 


Kt) 

= A + B s(t)  , 

(2.2) 

s(t) 

= a(t)  + b(t ) 

(2.3) 

The  photons  arrive  at  random  at  times  (t  , t , ...)  following  a Poisson 

X 2 

distribution  with  the  rate  given  by  Eo.  (2,2),  The  expectation  values 

(2.1)  and  the  coefficients  A,  B are  supposed  known.  Otherwise,  the 

totality  of  our  information  at  time  t is  the  series  of  times  oi  arrival 

of  photons  with  t < t. 

j 

We  suppose  that  the  program  defining  b(t)  is  linear  in  the  photon 
arrival  rate.  Then  the  most  general  program  is 


Preceding  page  blank 


b(t)  = Z N(t  - t ) - C 
j j 


(2.4) 


where  C is  a constant  and  N(r)  is  any  function  of  a real  variable  satis- 
fying 


N(t)  = 0 for  t < C 


(2.5) 


Our  objective  is  to  choose  the  program  (2.4)  so  that  b(t)  compensates  a(t) 
as  well  as  possible.  First,  we  require  zero  bias,  or 


<s(t)>  = <b(t))  = 0 


(2.6) 


which  fixes  the  constant  term  in  Eq.  (2.4), 


C = A I N(t)  dt 


(2.7) 


Second,  as  the  criterion  for  optimization  we  require  that  the  mean  fluc- 


tuation 


D = <[s(t)]  ) 


(2.8) 


be  as  small  as  possible.  The  averages  in  Eqs.  (2.6)  and  (2.8)  are  taken 
over  both  the  photon  statistics  and  the  random  variations  of  a(t)  (atmo- 
spheric statistics).  We  write  ( ) to  denote  an  average  over  photon 

P 

statistics  only  with  a(t)  fixed. 

We  define  Fourier  transforms  by 


U(-u)  = J U(t)  exp(ixt)  dt  , 


(2.9) 


and  similarly  for  N(tu) , a(iu),  s(uu)  and  so  on.  The  function  U(uj)  is  the 
spectral  function  of  the  random  variable  a(t);  it  is  real,  positive,  and 
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oven.  The  function  N(ju)  is  analytic  in  the  upper  half-plane.  The  re- 
sults of  our  analysis  are  summarized  in  the  following 

Theorem:  For  any  feedback  program  defined  by  Eqs.  (2.4)  and  (2.7). 

the  mean  fluctuation  is 


/ 


D (1/2-:)  I dx  IU(ju)  + A N (uj  ) 1 ] / [' 


1 - BN(X) 


2i! 


(2 . iO) 


The  optimum  program  has  N(t)  defined  by 

1 - BN(x)  = exp  [(1/2*1)  - U))_1  d,x'] 

G(w)  = log[l  + IL'Cr)]  , 

I = b2/a 


(2.11) 
(2. 12) 
(2.12) 


and  gives  the  minimum  fluctuation 
D = (1/2^1) 

If  an  upper  bound 


/ 


G(x)  dx  = - (A/B)  X ( 0+ ) 


M = Max 


U(x)] 


(2.14) 


(2.15) 


exists,  then  the  optimum  program  gives 


D < f Al/I  ] 


(2.16) 


The  proof  of  these  statements  is  given  in  Appendix  A.  We  continue 
here  with  a discussion  of  their  physical  meaning.  The  formula  (2.10) 
shows  the  total  system  noise  D as  a sum  of  two  terms,  the  atmospheric 
noise  proportional  to  U(x),  and  the  photon  noise  proportional  to  A,  both 
modified  by  the  feedback  factor  ]l  - BN(ui)  '2.  when  there  is  no  feedback 
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rN(x)  - 0] , D reduces  to  the  purely  atmospheric 


Na  = U(id)  du  = U(0) 


noi  so 


= aa(t)]  ) . 


(2.17) 


When  the  feedback  is  infinitely  strong  [Nfa)  _*  « for  w in  a finite  fre- 
quency band  of  width  rfy],  D reduces  to  the  pure  photon  noise 


Np  = (A/2  ’tB  ) J duj  = (y/2 1 ) 


(2.18) 


There  is  some  arbitrariness  in  the  definition  of  <y,  the  bandwidth  over 
which  strong  feedback  is  required.  For  definiteness  we  choose  to  define 
the  atmospheric  spectrum  amplitude  a and  width  Y by 


0 = u<°)  . Y = (M/2a  ) 


(2.19) 


with  M given  by  Eq.  (2.15).  Then  the  optimum  program  gives,  according 
to  Eq.  (2.16) , 


D<  [2Ya2/I]1/2 


which  may  also  be  written 


(2.20) 


D < 2 (N  N ) 
a p 


(2.21) 


The  optimum  program  thus  achieves  a tradeoff  between  atmospheric  and 
photon  noise  as  described  qualitatively  in  Section  I. 

The  motivation  for  the  definition  (2.19)  is  as  follows.  A standard 
example  of  a random  process  (not  necessarily  applicable  to  the  real  at- 
mosphere) has  the  autocorrelation  function 
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r 


U(t) 


a exp (-Y  t ) 


and  the  spectrum 


(2.22) 


~ 2 2 2 

U(x)  = f2ya  / (uu  + y )]  . (2.23) 

For  this  process,  Y defined  by  Eq.  (2,19)  coincides  with  the  decay- 
constant  Y in  U(t).  It  is  easy  to  compute  the  optimum  program  explicitly 
for  the  spectrum  (2.23).  We  find 


B N ( t)  = - (5  - Y)  exp(-Yt)  , t > 0 , (2.24) 

B N(i)  = - i (6  - Y)  ('JJ  + 1Y)”1  , Imr  > 0 , (2.25) 

2 2 1/2 

6 = (Y  + 2a  YI)  , (2.26) 


and  hence 


2 1/2  1/2  1/2 

D = ( (5  - Y)/I]  = (2Ya  /l)  [(1  + z)  - z ] , (2.27) 

z = (Y/2a2l)  = (N  /N  ) . (2.28) 

P a 

For  the  interesting  case  in  which  feedback  is  strong,  N » N , z is 

a p 

small,  and  the  estimate  (2.20)  of  the  optimum  system  noise  comes  very 
close  to  the  true  value,  Eq.  (2.27).  We  may  thus  expect  that  Eq.  (2.20) 
provides  a reasonably  quantitative  estimate  of  the  system  performance1, 
for  any  process  with  a spectrum  not  radically  different  in  shape  from 
Eq.  (2.23). 

If  the  atmospheric  fluctuation  spectrum  has  a sharper  cutoff  at  high 
frequencies,  so  that  the  long  tails  of  Eq.  (2.23)  are  absent,  then  the 
performance  is  far  better  than  the  estimate  (2.20)  would  indicate.  In 
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this  case  it  is  convenient  to  define  another  "effective  spectrum  width" 
by 

rfY  = y*(IU(tr)/[l  + IU(ju)]  da;  . (2.29) 

This  jtY  comes  closer  to  the  true  spectrum  width  of  U(uj)  as  the  cutofi  at. 
high  frequencies  becomes  sharper.  For  the  spectrum  (2.23),  y is  much 
larger  than  y,  and  so  Y is  an  inappropriate  quantity  to  describe  the 
spectrum  width.  On  the  basi  of  the  definition  (2,29)  we  can  prove  that 
an  optimum  program  gives 

D < N (1  - z)  1 log (z  1)  , (2.30) 

P 

z = (N  /N  ) , N = (Y/2I)  . (2.31) 

pa  p 

For  the  proof  see  Appendix  B.  The  result,  Eq.  (2.30),  shows  that  for  a 
spectrum  with  sharp  cutoff  the  atmospheric  fluctuations  can  be  compensated 
very  efficiently,  so  that  the  total  system  noise  with  the  optimum  program 
exceeds  the  pure  photon  noise  only  by  a logarithmic  factor. 

In  our  discussion  of  the  image-improvement  problem  we  shall  make 
use  of  the  pessimistic  estimate,  Eq.  (2.20)  or  Eq.  (2.27),  for  D and 
not  the  optimistic  estimate,  Eq.  (2.30).  In  calling  these  estimates 
pessimistic  and  optimistic,  we  do  not  imply  any  doubt  concerning  their 
validity.  Both  are  strictly  valid  in  all  cases.  However,  Eq.  (2.20)  is 
closer  to  equality  for  a long-tailed  spectrum,  while  Eq.  (2.30)  is  closer 
to  equality  for  a cutoff  spectrum.  The  long-tailed  spectrum  makes  the 
task  of  image-improvement  harder.  Therefore  the  use  of  Eq.  (2.20)  makes 
our  estimates  of  the  system  performance  conservative. 

For  completeness  we  add  here  two  more  estimates  for  the  optimum  D, 
each  of  which  may  be  convergent  for  spectra  of  shape  intermediate  between 
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long-tailed  and  cutoff.  The  inequality  (2.20)  holds  with  Y replaced  by 
either  Y(  or  ) , where  Y and  Yy  are  two  alternative  definitions  of  the 
spectrum  width, 


and 


Y = (c/1  ) 


jr 

iL'(x)  1 dxj  / 


, 1 /2 


1/ 


L'(x)  di 


Y = (c/2) 


ll/ox,  x d i j / |jl'(i  ) dx 


||!/2 


(2.32) 


(2.33) 


c = 0.6176  = Max  J x flog  (1  + x)] 
x I 


(2  3 1) 


The  proof  of  Eq.  (2.20)  with  these  definitions  is  given  in  Appendix  C. 

The  definition  (2.33)  is  (apart  from  the  numerical  coefficient)  the  most 

natural  definition  of  a spectral  width,  but  unfortunately  Y and  Y are 

1 2 

both  infinite  for  a long-tailed  spectrum  such  as  Eq , (2,23), 

To  conclude  this  section,  we  discuss  the  application  of  the  foregoing 
theory  to  the  problem  of  optical  phase-compensation  in  a simple  inter- 
ferometer. Suppose  we  have  an  interferometer  measuring  the  resultant  of 
two  coherent  light  signals  with  amplitudes  f,  g,  and  phase-difference 

s(t)  — (2.35) 

Here  s(t)  is  given  by  Eq.  (2.3),  a random  phase-difference  a(t)  produced 
by  the  atmosphere  plus  the  compensating  programmed  phase  difference  b(t). 
The  angle  may  be  chosen  to  make  the  interferometer  as  effective  as 
possible  in  keeping  s(t)  small.  The  output  of  the  interferometer  is  a 
photon  count  at  the  rate 

2 2 

I ( t ) = K v f + g + 2fg  cos  fs(t)  - "]  , (2.36) 
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where  R is  the  background  oj  incoherent  photons.  We  want  to  approximate 
this  nonlinear  response  by  the  linear  expression  (2.2)  with 


2 2 

A = R +•  f +•  g +-  2fg  cos  to  , 


(2.37) 


B = 2fg  sin  to 


(2.38) 


The  linear  approximation  will  be  good  if  the  system  succeeds  in  keeping 
s(t)  small.  The  replacement  of  sinfs(t)]  by  s(t)  requires  simply 


D « 1 


(2.39) 


The  replacement  of  cosfs(t)]  by  1 in  the  A term  requires 


fg  cos  to  D « A 


(2.40) 


Usually  Eq.  (2.39)  will  imply  Eq.  (2.40),  but  Eq.  (2.40)  may  be  violated 
even  when  Eq.  (2.39)  holds,  if  we  try  to  operate  the  interferometer  close 
to  zero  photon  rate,  with  R small,  f and  g equal,  and  to  close  to  n. 

According  to  Eq.  (2.14),  D depends  only  on  the  parameter  I = (B^/A) , 
and  D is  minimized  when  I is  maximized.  From  Eqs.  (2.37)  and  (2.38)  we 
find 


I - [(2fg  sin  to) /(x  + cos  to)]  . 


(2.41) 


X=[(h+f2+  g2)/(2fg)]  > 1 


(2.42) 


The  maximum  of  I is  reached  at  to  = to  w’herc 

o 


cos  to  = -£  , 


r - v 

*>  — X 


-4T^ 


(2.43) 


and  there  I takes  the  value 
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= 4fg  C 


(2.44) 


I 

Max 


On  the  other  hand,  at  cp  = n/2  the  value  of  I is 

i„  = 4teC/(i  + C2)  . 


(2-45) 


The  advantage  of  going  from  v/2  to  cp^  is  only  a factor  (1  + Q2)  x j 
which  produces  a factor  (1  + £ )~V2  in  D.  The  advantage  is  entirely 
inconsequential  except  when  x is  close  to  1.  Now  x can  be  close  to  1 
only  when  R is  small  and  f and  g are  almost  equal,  which  is  precisely 
the  situation  in  which  the  condition  (2.40)  may  be  violated.  In  fact  the 
choice  co  = C£q  gives  the  maximum  advantage  of  a factor  2~1/2  in  D,  only 
when  the  interferometer  is  operating  in  a state  of  almost  complete  de- 
structive interference  with  a very  low  counting  rate.  The  regime  of 
strong  destructive  interference  is  highly  unstable  against  departure  of 
the  amplitudes  f and  g from  equality,  or  against  increase  in  the  inco- 
herent background  R.  So  we  conclude  that  the  regime  cp  = * is  not  a 

o 

good  place  to  operate,  and  the  additional  factor  2_1/“  in  D is  not  in 
practice  attainable.  The  interferometer  will  operate  stably  at  cp  = rr/2 
and  give  a value  of  D close  to  the  attainable  minimum.  Also  the  condi- 
tion (2.40)  then  disappears.  So  we  shall  assume  in  all  the  later  dis- 
cussion that  interferometers  are  operated  in  quadrature,  with  the  inter- 
fei ing  signals  held  at  a phase  difference  of  (-/2). 

The  simple  interferometer  with  cp  = v/2  gives,  by  Eq.  (2.41), 


2 2 

where  ^ f . "2  = R are  the  counting  rates  of  the  two  coherent  beams 
in  photons  per  second,  and  n is  the  total  counting  rate  n = n + n + R 
The  estimate  (2.20)  for  D becomes 
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r 2 , ,1/2 

D < [Va  n/2n  n , 

1 2 


(2.47) 


The  condition  for  satisfactory  control  of  s(t)  is  D«  1,  which  means 


Mint (n^/y) 


(n2/y)l  » (1  + r)  a 


(2.48) 


where  r is  the  ratio  of  background  R to  signal  (n^  + n^ ) . Equation  (2.48) 

states  that  each  of  the  two  interfering  beams  must  deliver  a number  of 

2 

photons  greater  than  (1  + r)  a within  the  typical  period  of  fluctuation 

of  the  atmosphere.  If  we  take  as  typical  of  atmospheric  dit.  urbances 
3 -1 

Y = 10  sec  and  a = t radians,  and  assume  a simple  interferometer  with 

r = 0,  the  condition  (2.48)  requires  each  of  the  interfering  beams  to 

5 

contain  at  least  10  photons  per  second.  The  result,  Eq.  (2.48),  is 
basic  to  the  understanding  of  the  image-improvement  problem.  It  defines 
a limiting  light  intensity  above  which  all  the  sensors  of  an  improvement 
system  must  operate  if  they  are  to  achieve  effective  control  of  phase 
errors. 
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t 


L 


III  DEFINITION  OF  A FEEDBACK  PIOGRAM 
FOR  A SINGLE-IMAGE  OPTICAL  IMPROVEMENT  SYSTEM 

Tho  remaining  soctior s of  this  report  contain  a detailed  description 
and  optimization  of  the  single-image  optical  improvement  system  whoso 
general  nature  and  purpose  were  discussed  in  Section  I.  A system  of  tho 
same  type  has  also  been  described  by  Muller  and  Buffington  (1974),  who 
have  explored  its  capabilities  with  computer  simulations  and  are  pro- 
ceeding with  the  construction  of  an  experimental  model. 

The  basic  idea  of  Muller  and  Buffington  is  to  measure  in  real  time 
the  quan* ity 

S(t)  - Ji(y.t)  IQ(y)  dy  , (3.D 

where  I(y,t)  is  the  actual  brightness  at  tho  point  y of  the  imago,  and 
IQ(y)  is  tho  brightness  of  an  ideal  comparison  image.  They  call  S(t) 
the  sharpness"  of  the  image.  They  prove  that,  under  certain  specified 
conditions,  an  image  that  maximizes  S(t)  also  provides  a faithful  picture 
of  tho  object.  They  use  S(t)  both  as  the  criterion  of  performance  of  the 
system  and  as  the  sensory  input  to  their  feedback  system.  Their  program 
is  designed  to  hunt  for  the  maximum  of  S(t),  with  no  other  information 
than  S(t)  itself  to  determine  the  curve  of  pursuit.  Since  the  hunt  takes 
place  in  a multidimensional  space  of  mirror  displacements  amid  a back- 
ground of  rapidly  fluctuating  atmospheric  perturbations,  it  is  important 
to  make  the  strategy  of  the  hunt  as  efficient  as  possible. 

The  essential  difference  between  the  system  described  in  this  re- 
port and  the  Muller-Buf f ington  system  is  that  we  here  separate  the  assess- 
ment function  of  S(t)  from  the  hunting  function.  We  still  use  tho  quantity 


1 
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k(t),  or  more  accurately  the  similar  quantity  defined  by  Eq.  (4.1)  below, 
to  assess  the  performance  of  the  system.  But  we  allow  all  the  information 
contained  in  the  image-distribution  I(y,t),  and  not  only  the  single 
number  S(t),  to  be  used  as  sensory  input  for  the  hunting  proKram.  Our 
stra teBy  for  the  hunt  is  to  take  the  very  Bencral  feedback  proBram  de- 
fined by  Eq.  (3.8),  and  to  optimize  it  usinB  S(t)  as  the  criterion  of 
success.  To  use  a pictorial  analogy,  Muller  and  BuffinBton  are  in  the 
situation  of  a man  trying  to  stay  near  the  top  of  a multidimensional 
mountain  in  terrain  wracked  by  constant  earthquakes,  with  no  other  sen- 
sory data  than  the  reudinBs  of  a pocket  altimeter.  We  allow  the  man  to 
use  his  ey  ; to  obtain  a partial  view  of  the  terrain  surrounding  him. 

We  pay  a price  for  usinB  a much  larger  sensory  input,  in  that  our 
huntinB  program,  Eq.  (3.8),  is  restricted  to  be  linear  in  the  input  data 
I (y * t ) • Tho  ‘-lass  of  nonlinear  programs  is  too  larBc  for  us  to  find  the 
optimum  in  it  by  analytical  methods.  The  best  wo  can  do  is  to  find  the 
optimum  linear  program.  Muller  and  BuffinBton  use  a nonlinear  program 
to  make  the  best  use  of  their  single  input  S(t).  So  it  is  not  necessarily 
t1^  that  our  optimum  program  is  better  than  theirs.  Both  hunting  stra- 
tegies should  be  tested  in  operational  systems  to  see  which  works  best. 

In  Section  VI  of  this  report  we  examine  briefly  the  extension  of  our 
analysis  to  include  some  nonlinear  programs. 

The  starting-point  of  our  analysis  is  the  equation 
(Kq,>))p  = JdO  (q/2  L)2  O(q,0) 

j dx  exp  iqx  • (0  - y/L)  + iq  € (x ) ] ' 2 , 2) 

relating  the  statistical  expectation  value  of  the  flux  I(q,y)  of  photons 
of  wavenumber  q arriving  at  a point  y of  the  image  plane  to  the  flux 
0(q,9)  of  photons  incident  from  the  direction  0 in  the  sky.  We  take  0 
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to  be  a 2-dimensional  vector  defined  by  the  components  of  the  photon 
direction  normal  to  the  optical  axis.  L is  the  focal  length  of  the 
system  and  x is  integrated  over  the  mirror.  The  error  term  e (x)  rep- 
resents the  displacement  of  the  wavefront  of  a photon  reflected  from  the 
mirror  at  the  point  x.  We  suppose  that  e (x)  is  a superposition  of  effects 
arising  from  atmospheric  inhomogenoitios  in  the  light-path  and  from 
displacement  of  the  mirror  surface.  The  use  of  Eq . (3.2)  implies  two 
basic  approximations.  First,  there  is  the  geometrical-optics  approxima- 
tion of  representing  the  entire  effect  of  atmospheric  refraction  by  a 
change  of  phase  of  the  wave  amplitude.  Second,  there  is  the  narrow-field 
approximation  of  replacing  the  displacement  e(x,q,0),  which  should  depend 
on  the  photon  parameters  q and  0 , by  a function  of  x alone.  The  width 
of  field  over  which  image  improvement  is  possible  is  defined  precisely  by 
the  range  of  directions  0 over  which  the  ©-dependence  of  e can  be  ne- 
glected. The  neglect  of  the  q-dependence  of  e means  that  we  are  treating 
the  atmosphere  as  nondispersivo . 

The  formula  (3.2)  belongs  to  classical  electromagnetic  theory.  It 
describes  the  expected  value  of  the  flux  of  photons,  ignoring  quantum 
fluctuations.  To  take  into  account  the  quantum  nature  of  light,  we  must 
compute  quantities  such  as 

K(y,t,y',t')  = { I (y , t ) I(y',t')> 

P 

- 0<y»t))  <I(y',t')>  , (3.3) 

P P 

following  the  method  of  Appendix  A.  In  the  calculation  we  take  into 
account  the  correlation  between  phr con  statistics  at  different  times 
produced  by  the  feedback  circuit.  The  light  is  treated  as  a rain  of 
incoherent  particles,  the  number  of  particles  arriving  in  each  patch  of 
the  image  Ting  a random  variable  with  a Poisson  distribution.  We 
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completely  ignore  the  subtler  effects  of  partial  coherence  in  bilinear 
express  ions  such  as  Eq,  (3,3)— —the  effects  that  were  exploited  by  Hanburyf 
Brown  and  Twiss  (1958)  in  the  design  of  their  intensity  interferometer. 

It  seems  unlikely  that  Hanbury-Brown-Twiss  effects  could  be  significant 
in  the  present  context,  but  the  point  should  perhaps  be  investigated. 

The  wavefront  displacement  e(x,t)  is  assumed  to  bo  composed  of 
throe  parts, 

e (x, t)  = a (x, t ) + b(x,t)  + c(x,t)  . (3.4) 

Hero,  a (x , t ) is  the  unknown  atmospheric  effect  that  we  desire  to  com- 
pensate, b (x , t ) is  the  mirror  displacement  (multiplied  by  2)  which  we 
try  to  make  equal  and  opposite  to  a(x,t),  and  c(x,t)  is  a high-frequency 
tremor  thM  we  apply  to  the  mirror  in  order  to  sense  the  mirror-image 
interaction.  The  function  of  the  tremor  is  in  some  respects  analogous 
to  that  of  the  tremor  of  the  human  eye,  which  provides  the  brain  with 
ac  feedback  signals  facilitating  the  precise  control  of  the  movement  of 
the  eyeball.  The  frequencies  of  the  tremor  are  assumed  to  be  in  the 
range  of  tens  of  kilohertz  or  higher,  veil  separated  from  atmospheric 
frequencies.  We  assume  lor  the  tremor  movement  the  form 

c(x,t)  = r p (x)  c (t)  , (3,5) 

j J '1 

where  the  index  j denotes  a patch  or  normal  mode  of  the  mirror,  and  the 
c j ( t ) corresponding  to  different  j are  oscillating  at  well  separated 
frequencies.  We  suppose  the  c^.(t)  normalized  so  that 

Av[c  (1)  c (t)  ] = £ 
j k jk 

when  averaged  over  a tim-  comparable  to  the  atmospheric  fluctuation 
periods.  Then  Eq.  (3.5)  implies 
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(3.6) 


Av[c(x,t)  c(x',t)] 


^(x,*')  = Z P (x)  p (x') 
j j j 


f 


and  the  covariance  function  F(x,x')  defines  the  statistical  properties  of 
the  tremor.  We  arc  free  to  choose  F(x,x')  in  any  convenient  fashion, 
subject  to  the  limitation  that  the  tremor  amplitude  should  be  a small 
fraction  of  an  optical  wavelength.  For  definiteness  we  restrict  the 
amplitude  by  assuming 

r,  . „ 2 

F(x,x)  < X , / o 7\ 


where  2r*X  is  an  average  photon  wavelength. 


Tac  feedback  from  image  to  mirror  is  defined  by  a general  bilinear 


expression 


b(x , t) 


N(x,x',y,t  - t')  c(x',t')  [ 1 ( y , t ') 


iL(y,t')] 


(3.8) 


with 


Ky.t) 


Hq,y,t) 


(3.9) 


and  N ( x , x ' , y , t ) a kernel  whose  values  are  programmed  into  the  computer 
that  controls  the  mirror.  In  Eq.  (3.8)  we  subtract  from  the  momentary 
image  I(y,t)  the  long-term  average  image  defined  by 


IL(y,t) 


^00 


Hy,t')  dt' 


(3.10) 


where  (t  ) is  a function  extending  over  many  seconds,  containing  only 
frequencies  much  lower  than  the  relevant  atmospheric  ferquencies.  The 
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normalization  condition 

^ (t ) d"  " 1 (3.11) 

ensures  that  the  feedback  b(x,t)  responds  only  to  atmospheric  fluctuations 
of  the  image  and  not  to  the  constant  part  of  the  image.  The  photon 
fluctuations  in  I will  be  negligible  in  comparison  with  those  in  1. 

The  physical  meaning  of  Eq.  (3.8)  is  that  the  computer  calculates  the 
correlation  between  the  mirror  tremor  at  x'  and  the  image  response  at  y, 
and  uses  the  result  to  adjust  advantageously  the  displacement  of  the 
mirror  at  x.  Our  main  problem  is  to  determine  the  optimum  choice  of  the 
kernel  N.  Note  that  we  have  I(y,t)  and  not  (Ky.t))  on  the  right  side 

of  Eq.  (3.8),  so  that  the  effects  of  quantum  statistics  are  included  in 
the  ^eedback. 

The  most  reliable  w->y  to  proceed  from  this  point  on  would  be  to 
carry  out  an  accurate  computer  simulation  of  the  response  of  the  mirror 
to  an  imposed  a(x,t)  derived  from  some  model  of  the  atmospheric  fluctuations, 
using  the  nonlinear  feedback  equation  (3.8)  without  further  approximation. 
Instead  of  this,  since  our  aim  is  a qualitative  understanding  of  the  sys- 
tem rather  than  numerical  accuracy,  we  immediately  linearize  Eq.  (3.8)  by 
treating  all  the  displacements  c(x,t)  and 


s(x,t)  = a (x , t)  + b(x , t) 


(3.12) 


as  small  compared  with  an  optical  wavelength.  We  require 

s(x,t)  < \ , (3.13) 

in  addition  to  Eq . (3.7).  This  requirement  means  that  the  feedback 
system  must  be  successful  in  keeping  the  sum  s(x,t)  small,  even  when 
a(x,t)  and  b(x,t)  are  individually  large.  Our  procedure  is  self-consistent 
only  if  in  fact  Eq.  (3.13)  is  satisfied  after  the  optimized  kernel  N is 
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chosen.  The  criterion  of  optimization  will  be  determined  with  this  con- 
dition in  mind. 

The  linearized  form  of  Eq.  (3.8),  obtained  by  expanding  the  exponen 
tial  in  Eq.  (3.2)  and  using  Eq . (3.6),  is 

<b(x,t))p  = J dt'Jdx'  r(x,x/,t  - t7)  (s(x/,t/))p 

(3.14) 

Because  Eq.  (3.2)  is  used,  we  can  obtain  a linear  equation  only  for 
(b)  and  not  for  b itself.  The  kernel  T is  given  by 

r(x,x/,T)  = Jdq(q2/2^TL)2  d§[F(x"  ,§)  - F(x7,§)] 


2 Re{M(q,x7  - x/7  )L[x,|,q(x7  - x77),t]} 


w he  re 


M ( q , z ) = h ,9)  exp(iq0  • z) 


is  the  Fourier  transform  of  the  object,  and 


(x,x7,z,-)  =jN(x,x',y ,t)  exp(-iy  • z/L) 


(3.15) 


(3.16) 


(3.17) 


is  the  transform  of  the  feedback  kernel.  The  subtracted  term  I in  Eq. 

L 

(3.8)  has  disappeared  from  Eq.  (3.15),  because  the  tremor  signals  vanish 
when  averaged  with  $(t  - t7).  For  Eq.  (3.14)  to  hold,  we  require  in 
addition  a consistency  condition 

Jdq(q/2'tL)2q  fffjtx'  dx7'd*dT  [F(x77  ,§)  - F(x7,§)] 


Imf  M(q  , x 7 - x 77  ) L[  x , § ,q  (x  * - x77  ),t]}  = 0 , 


(3.18) 


r^r 


K 


which  ensures  that  zero  stimulus  s(x,,t/)  produces  zero  response  b(x,t). 
If  we  write  Eq.  (3.14)  in  symbolic  form  as 


(b)  = r(s>  = r (a  + (b>) 
we  can  write  the  solution  as 

( s)  = Qa  , Q = [ 1 - r] 


In  longhand,  Eq.  (3.20)  means 

, t 


<s(x,t))p  = j dt'Jdx'  Q(x,x/,t  - t')  a(x/,t/) 


(3.19) 


(3.20) 


(3.21) 


with  the  kernel  Q defined  by  inverting  the  operator  (1  - ~). 

There  is  a close  formal  analogy  between  the  feedbark  system  defined 
by  Eq.  (3.8)  and  the  interferometer  described  in  Section  II.  One  point 
that  is  not  obvious  is  the  analogy  between  the  use  of  a sensing  tremor 
c(x,t)  and  the  choice  of  phase  in  the  interferometer.  The  image  in- 
tensity I(y,t#)  is  a sum  of  contributions  of  the  form 

exp{  iq  [s(Xi)  + c^)  - s(x,2)  - c(x  )]}  , (3.22) 

from  pairs  of  points  (x  ,x  ) on  the  mirror.  The  dc  signal  I(y,t')  with 
c(x)  = 0 is  analogous  to  an  interferometer  output  with  phase  cp  = 0.  When 
we  add  the  tremor  c(x)  and  pick  out  the  ac  signal  from  I(y,t')  by  forming 
the  expression  (3.8),  wo  are  replacing  Eq.  (3,22)  a contribution  pro- 
portional to 

i expfiq  [s(Xi)  - s(x2)]}  = expf  i [qsfx^  - qs(x2)  + £]} 

(3.23) 
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Tho  ac  signal  is  thus  analogous  to  the  output  of  an  interferometer  with 
c.  * "/2.  The  consistency  condition  (3.18)  is  required  in  order  to  make 
the  phase  relation  cc  - -V2  exact.  We  can  therefore  expect  that  all  the 
results  of  Section  II  have  analogs  for  tho  system  described  in  this 
sect  ion. 

To  complete  the  definition  of  tho  system  we  need  a statistical  model 
of  the  atmosphere.  It  is  unnecessary  to  specify  tho  behavior  of  the 
atmospheric  displacements  a(x,t)  themselves,  which  are  poorly  defined 
and  subject  to  irrelevant  large-scale  fluctuations.  Only  differences 
between  tho  a (x,  t ) are  observable.  Wo  therefore  siiocify  the  atmosphere 
by  means  of  the  covariance  function 

( f a (x , t ) - a (x  , ,t  ' ) ~)  = l(x  - x\t  - t')  . (3.21) 


\ery  little  is  known  observat  ional  ly  about  U(f,  ).  Since  the  high- 
altitude  inhomogeneities  are  usually  drifting  with  the  wind,  the  space 
and  time  dependences  of  U(§,~)  will  in  general  be  strongly  correlated. 
No  universal  formula  for  U(|,t)  is  likely  to  be  valid  on  all  occasions. 
Nevertheless  it  is  helpful  to  have  some  definite  mode  1 as  a b; sis  for 
discussion,  and  for  this  purpose  we  use  tho  formula 


L'<5,0) 


(3.25) 


deduced  by  Tatarsky  (1968)  from  a theoretical  study  oi  atmospheric  tur- 
bulence. The  length  p defines  the  seeing  patch  size,  or  smallest  scale 
of  optically  disturbii  . luctua t ions . The  angular-resolution  limit  set 
by  the  seeing  is  of  the  order 


* ~ (2  ^P>  • (3.26) 

Under  conditions  of  goixl  seeing,  a is  about  1 second  of  ur<  and  p lies  in 
the  range  from  10  to  20  cm. 
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IV  CRITERION  FOR  OPTIMIZING  THE  SYSTEM 


V 


Wo  use  as  criterion  of  overall  performance  of  the  system  o,  Section 
III  tho  quantity 


te(x)  - e(x')l  dx  dx' 


(4.1) 


Which  is  a measure  of  the  »o„„-sqUaro  distortion  „f  the  wavefront  over 
tho  whole  mirror.  The  criterion  (4.1),  eontrast  to  Dr.  Muller’s 
criterion  (3.1),  Ca„„„t  bo  dlrecUy  measured  by  monitoring  the  imago. 
Muller's  choice  of  (3.1)  is  constrained  by  practical  necessity,  since 
his  system  works  by  monitoring  s(t)  in  teal  time.  For  us  tho  criterion 
(1.1)  is  better,  since  it  defines  more  sharply  the  characteristics  that 
should  be  maximized  „ good  imago,  and  we  are  no,  here  discussing  a 
system  in  which  S is  measured  and  maximized  in  real  time.  We  are  using 


S2  only  as  a tool  for  the  theoretical  assessment  of  the  overall  perform- 


ance of  a system  that  is  specified  independently  of  s.,.  using  s we 
take  a "Cod's-eye  view”  of  the  system,  calculating  tho  degree  of  con- 
sonance between  the  actual  wavefront  and  the  perfect  wavefront  tha,  is 
invisible  to  mortal  eyes. 


W°  "OXt  h3Ve  t0  Ch°°Se  3 Standard  of  comparison  to  define  how  small 


S2  should  be  to  give  acceptable  performance.  A suitable  criterion  is 


« Sx,  where  is  the  expression  obtained  from  Eq.  (4.1)  by  taking 
the  phases  qc (x)  to  have  everywhere  the  root-mean-square  value  of  1 
radian , with  c (x)  and  e(x')  uncorrelatod  when  x / x'.  The  condition 

S2  ' Si  then  mojns  that  the  phase  differences  between  different  points 
nn  the  mirror,  which  it  is  the  purpose  of  the  system  to  control,  are  in 
fact  controlled.  To  define  Sl  precisely  we  make  the  replacement 
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2 2 

fe(x)]  -*  \ , e(x)  e (x')  -*  0 


in  Eq,  (1.1) , and  obtain 


2 2 

S = X A 


where  A is  the  area  of  the  mirror.  The  ratio 


“■  <YV 


is  our  normalized  criterion  of  performance,  with  D«  1 for  a good 
diffraction-limited  image. 

».o„  «c  substitute  ,<«>  . s(x>  ♦ c(x>  In  E„.  (1.1),  and  nwneo 
over  atmospheric  and  photon  statistics,  the  terms  in  s(x)  and  c(x> 
separate , and  we  find 


D = D + D 
S c 


(4.4) 


Ds  - (1/s 


.■  I ff 


dx  dx'  (ts(x)  - s(x/)]“) 


(4.5) 


uhile  Dc  reduces  by  Eqs.  (3.6)  and  (4.3)  to 


D = F n' 

c o 


F = A 
o 


'7pfa- 


x)  dx 


(4.6) 


The  condition  D < 1 thus  includes  Eq.  (3.7)  automatically.  The  main 

task  of  the  statistical  analysis  is  to  compute  Dg.  For  this  purpose 

linear  relations  such  as  Eqs.  (3.14)  and  (3.21)  are  insufficient.  We 

need  to  grapple  with  photon-photon  correlations  expressed  by  quadratic 

quantities  such  as  Eq.  (3.3).  The  details  of  the  calculation  of  D are 

s 

given  in  Appendix  D.  Tnc  result  is 
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Ds  = dx'ffdYX2 


jj  dtidt^[Q(x,xi,t  - ti>  - 


Q(x,.xi>t  - t^] 


[Q(x,x2>t  - tg)  - Q(x',xQ,t  - t2>] 


| — U ( x - x,t  - t ) 

| 2 1 2’  1 2 


////< 


dz  dz'  dydt'  F(z,z')  I (y,t') 


N(xi>z,y,ti  - t')  N(x9,z/,y,t9  - t')j 


(4.7) 


The  meaning  of  this  formidable  expression  may  become  clearer  if  it  is 

compared  with  the  corresponding  expression  (2.10)  for  the  mean-square 

fluctuation  of  a simple  linear  servosystem  in  Section  II.  The  two  fac- 

, _2 

tors  Q in  Eq.  (4.7)  are  exactly  analogous  to  the  factor  1 - BN(x)  in 
Eq.  (2.10),  Both  arise  by  inverting  the  kernel  of  the  linear  feedback 
equation,  Eq.  (3.14)  or  (A-3)  respectively.  The  atmospheric  noise  term 
U appears  in  Eq.  (4.7)  with  the  coefficient  (-1/2)  as  compared  with  the 
corresponding  term  U(x)  in  Eq.  (2.10),  because  the  definitions  (3.24) 
and  (2.1)  are  slightly  different.  The  photon  noise  term  defined  by  the 

last  fourfold  integral  in  Eq.  (4.7)  has  the  same  structure  as  the  term 

■ ~ .2 

A N(x)  in  Eq.  (2.10),  N being  in  each  case  the  feedback  kernel  and  I 

o 

or  A the  rate  of  arrival  of  photons.  Only  the  factor  F in  Eq.  (4.7)  has 
no  analog  in  Eq.  (2.10),  since  it  arises  from  the  mirror  tremor  system 
which  was  absent  in  the  simple  servo. 

The  next  problem  is  to  optimize  the  whole  system  by  minimizing  D. 
The  quantities  that  we  are  free  to  vary  are  the  tremor  autocorrelation 
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function  F(z,z')  and  the  feedback  kernel  N(x,z,y,T).  Both  F and  N occur 
in  a highly  nonlinear  way  in  D,  since  Q is  defined  as  the  inverse  of 
(1  - D , with  T given  by  Eq.  (3.15).  It  might  seem  almost  as  hopeless 
to  minimize  Eq.  (4.7)  by  disentangling  it  analytically,  as  to  search  for 
a minimum  numerically  by  trying  all  functions  F and  N of  two  and  four 
variables.  Fortunately  the  analogy  with  Eq.  (2.10)  comes  to  our  rescue. 

In  the  following  section  we  make  an  approximation  that  we  call  the  Big- 
Mirror  Approximation,"  consisting  ersontially  in  the  neglect  of  effects 
of  the  mirror  edges.  With  this  approximation,  can  be  reduced  to  a 
sum  of  terms  of  precisely  the  form  of  Eq.  (2.10),  and  so  the  theorem  of 
Section  II  can  be  used  to  minimize  it.  The  minimization  can  then  be 
carried  through  analytically  to  the  end,  and  exact  expressions  are  ob- 
tained for  the  optimum  choice  of  the  functions  F and  N.  We  do  not  claim 
that  the  big-mirror  approximation  is  accurate  for  a telescope  of  practical 
size.  Its  purpose  is  not  to  give  numerically  accurate  results,  but  to 
give  a qualitative  understanding  of  the  behavior  of  a real  system.  Thus, 
in  designing  a feedback  system  for  a real  telescope,  we  should  use  the 
exact  formula  (4.7)  rather  than  the  big-mirror  approximation  to  estimate 
its  performance.  But  the  realistic  minimization  of  Eq.  (4.7)  can  be  done 
using  the  big-mirror  forms  of  F and  N as  starting  point.  Allowing  F and 
N to  have  the  shape  indicated  by  the  big-mirror  approximation,  with  one 
or  two  adjustable  parameters  to  fix  the  overall  strength  and  range  of 
the  feedback,  we  should  be  able  to  compute  and  minimize  Eq.  (4.7)  numeri- 
cally, and  so  arrive  at  a design  close  enough  to  the  optimum  for  practi- 
cal purposes. 
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V OPTIMIZATION  OF  THE  SYSTEM 
IN  THE  BIG-MIRROR  APPROXIMATION 


The  big-mirror  approximation  consists  in  imposing  periodic  boundary 
conditions  on  the  wave  amplitudes  at  the  edges  of  the  mirror.  In  realit; 
the  edges  are  free  and  there  are  no  boundary  conditions.  By  imposing 
periodic  conditions  we  grossly  misrepresent  all  effects  arsing  from  the 
edges  of  a real  mirror.  Since  the  edge  effects  may  be  expected  to  become 
less  important  as  the  mirror  is  made  larger,  we  may  hope  that  this  ap- 
proximation becomes  in  some  sense  asymptotically  exact  as  the  size  of  the 
mirror  tends  to  infinity.  In  any  case,  the  approximation  may  be  useful 
even  if  it  is  not  asymptotically  exact. 

For  definiteness  we  suppose  that  the  mirror  is  a regular  hexagon  of 
diameter  d measured  from  edge  to  edge.  Its  area  is 

A = V3  ) d2 

\2  1 ) • (5.1) 

The  periodicity  is  that  of  a regular  triangular  lattice  with  lattice- 
constant  d.  Nothing  essential  in  our  calculation  would  be  changed  if  we 
took  the  mirror  and  the  periodicity  lattice  to  be  square,  only  a hexa- 
gonal mirror  looks  a little  more  realistic. 

To  take  advantage  of  the  periodicity,  it  is  convenient  to  choose  the 
normal  modes  p.(x)  of  the  mirror  tremor  co  be  standing  waves  with  the 
same  period  as  the  mirror.  Each  Pj(x)  is  then  a sine  or  cosine  of  a 
scalar  product  (v  • x).  We  shall  always  use  the  letter  v to  denote  a 
vector  of  the  reciprocal  lattice-constant  (4r3“1/2d_1 ) . For  each  v other 
than  zero,  we  have  a cosine  and  a sine  mode  that  we  suppose  to  have  equal 


I 

\ 


intensity  of  tremor.  Then  the  covariance  function  defined  by  Eq.  (3.6) 
becomes 


F(x,x'  ) = F(x  - x'  ) 


(5.2) 


invariant  under  translation  and  periodic  with  the  periodicity  of  the 
mirror.  We  can  expand  F into  a Fourier  series 

F(z)  = A 1 £ F(v)  exp(iv  • z)  , (5.3) 

v 


and  similarly  for  all  other  periodic  functions.  The  quantity  F defined 

o 

by  Eq.  (4.6)  is  now 


F = F(0)  = A”1  £ F(v) 
o v 


(5.4) 


The  basic  optical  equation  (3.2)  takes  the  form 


<Kq,y)> 

p 


(L2A)_1  r 0[q , (v/q)  + (y/L)] 
v 


dx  exp[ iv  • x + iqe (x)] 


2 


(5.5) 


when  the  wave  amplitude  at  the  mirror  is  made  periodic.  The  ideal  image 
obtained  from  Eq.  (5,5)  with  e (x)  = 0 is 


I (q,L6)  = AL 


0(q,9) 


(5.6) 


Since  the  periodic  mirror  has  no  edges,  there  is  no  diffraction  in  the 
absence  of  phase  errors,  and  so  the  ideal  image  is  u perfect  copy  of  the 
object. 

Since  all  points  on  the  mirror  are  equivalent,  the  feedback  kernel 
in  Eq.  (3.8)  will  also  be  assumed  translation-invariant  and  periodic. 

We  expand  it  into  a Fourier  series  in  space  and  a Fourier  integral  in  time, 
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We  calculate  the  kernel  FCx.x'.t)  as  before  from  Eqs,  (3.8)  and  (5.5). 
We  find  that  f is  also  translation-invariant  and  periodic,  with  Fourier 
transform  given  by 


r(v,,) 


N ( v,  L 0 ,uu ) D2  (G  , v)  dO 


(5.8) 


where  D^G.v)  is  a second-difference  of  the  object  brightness  distribution 
in  the  direction  G , 

D2  (9,v)  = f q2dq  {O[q,0  + (v/q)) 


+ ofq.9  - (v/q)l  - 20 (q , 0 )]  . (5.9) 

The  normalization  of  r(v,x)  is  chosen  as  in  Eq.  (5.7),  so  that  ?(v,x)  is 
precisely  the  eigenvalue  of  the  kernel  r(x,x',  t - t')  acting  on  a plane 
wave  cxp(iv  • x'  - ioi t ' ) . The  inversion  (3.20)  then  gives  immediately 
the  Fourier  transform  of  the  kernel  Q, 

Q(v,x)  = [1  - f(v,x)]  1 . (5.10) 

It  is  easily  verified  that  the  consistency  condition  (3.18)  is  automati- 
cally satisfied  when  the  kernels  F and  L are  translation-invariant. 

We  are  now  ready  to  substitute  the  periodic  kernels  into  Eq.  (4.7), 
which  then  happily  reduces  from  a tenfold  to  a fourfold  integral, 


f 


D = (G/2n)  I IdL  C(v)  |l  - rCv.aj)  | 


[~  2 U^v>u;)  + AF(v)  I I dqd0 


*//- 


O(q,0)  !N(vfL  0 ,ui) ! ] , 


(5.11) 


with  T given  by  Eq.  (5.8), 


2 -1 
G = (\  A) 


(5.12) 


C(v)  = 1 for  (v  ^ 0)  and  C(v)  = 0 for  (v  - 0) 

(5.13) 

The  similarity  in  structure  between  Eq . (5.11)  and  Eq . (2.10)  of  Section 

II  is  now  clear,  and  will  be  exploited  in  the  minimization  of  D . The 

S 

factor  (-1/2)  in  Eq . (5.11)  looks  wrong,  but  is  correct.  The  Fourier 
transform  of  any  realistic  atmospheric  covariance  function — for  exan-ple, 
Eq.  (3.25) — is  in  fact  negative. 

We  optimize  the  choice  of  the  kernels  F(v)  and  N(v,y,u>)  in  three 
stages,  fixing  first  the  y-dependence  of  N,  then  the  (v  ,u))- dependence  of 
N and  lastly  the  v-dependence  of  F.  The  y-dependence  of  N is  simple, 
since  for  any  fixed  values  of  (v  ,uj)  the  kernel  N(v,y,iu)  appears  in  D 

s 

only  in  the  linear  combination  of  Eq.  (5.8)  and  in  the  quadratic  term  of 
Eq.  (5.11).  The  optimum  N must  therefore  be  of  the  form 

N(v,L  G.UJ)  = N(v,uj)  D2(9,v)  [0(9)]  1 , (5.14) 


wi  th 


0(0) 


/' 


dq  0(q  ,9  ) 


(5.15) 
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ihc  recipe  (5.14)  agrees  with  the  intuitive  notion  that  an  optimum  feed- 
back will  emphasize  those  points  y of  the  image  that  correspond  to  places 
of  maximum  contrast  on  the  object.  The  second  difference  D (0,v>  picks 
out  the  places  of  high  contract.  When  Ec, . (5.11)  is  substituted  into 
Eqs , (5.8)  and  (5.11),  the  result  is 


US  ^ 1 - F(v)A(v)  N(v,m) 

{-1/2  L'(v,uu)  + AF(v)A(v)  N(v  ,tu)  2]  , 


(5.16) 


Mv)  = Jd9{  [Dy(6,v)]2  0(G)} 


(5.17) 


£(v)  is  a measure  of  the  contrast i ness  of  the  object  at  angular  scale 
not  greater  than  v , emphasizing  second  differences  of  the  brightness 


distribution. 


We  are  now  ready  to  complete  the  optimization  of  N.  For  every  fixed 
v.  the  integral  in  Eq . (5.16)  is  precisely  of  the  form  of  Eq . (2.10) 
and  can  be  minimized  by  choosing  N(v,uu)  according  to  Eq . (2.11).  Thus, 

(v,“»  is  (,c“termi ned  as  that  function  analytic  in  the  half-plane 
(I mu  >0)  for  which 

1 - F(v)£(v)  N(v,x)  ^ = 1 - (2A)"1  F(v)a(v)  U(v,uu)  (5.18) 

on  the  real  ir-axis.  With  this  choice  of  N,  the  value  of  Dg  becomes, 
according  to  Eq.  (2.11), 


US  = (G/“  ) F(v)^(v)J  C(v) 


log  1 - (2A)  F(v)Mv)  U(v,ui)]  duu 


(5.19) 
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To  proceed  from  this  point  with  -;he  optimization  of  F(v)  for  any  given 
U(v,oj)  would  be  possible  but  tedious.  Instead  we  shall  simplify  the 
algebra  by  making  a physical  assumption  about  the  atmospheric  fluctuation 
spectrum.  We  suppose  that  the  atmospheric  fluctuations  of  spatial  wave- 
number  v decay  exponentially  in  time  with  the  dependence 


cxpf-y(v)  t1  - t2  ] 


(5.20) 


where  Y(v)  is  an  arbitrary  function  of  v.  This  means  that  the  Fourier 
transform  U has  the  form 


U(v,oj)  = -W(v)  2Y(v)  {ou  + [Y  (v ) J ) 


2,-1 


(5.21) 


normalized  so  that  L-W(v)]  is  the  spatial  Fourier  transform  of  the  equal- 
time atmospheric  covariance  function  V(x^  - x^O).  The  assumption  (5.21) 
is  not  completely  general,  but  it  is  also  not  over-optimistic.  In  reality 
it  is  likely  that  the  high-frequency  components  of  the  atmospheric  dis- 
turbance, which  are  the  most  difficult  to  comp^i^ate , will  decrease  with 
,Ji  morc  rapidly  than  the  long-tailed  Lorentz  shape  (5.21)  would  indicate. 

When  U has  the  form  (5.21),  the  optimum  N satisfying  Eq . (5.18)  is 
given  explicitly  by 


N(v,Ui)  - - - i A W(v)  g[z(v)]  [to  + iY  (v)  ] 1 


(5.22) 


wi  th 


-1 


z(v)  = F(v)A(v)W(v)  [AY (v) ] 


g(z)  = 2z  1 [(1  + z)1  2 - 1] 


(5.23) 

(5.24) 


Equation  (5.22)  states  that  the  feedback  that  produces  displacements  with 
spatial  wavenumber  v on  the  mirror  is  applied  with  the  same  exponential 
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time-dependence  (5.20)  as  the  natural  decry  of  atmospheric  fluctuations 
at  the  same  wavenumber.  The  time-dependence  of  the  feedback  is  thus 
completely  determined  by  that  of  the  atmosphere.  The  space-dependence 
of  the  feedback  has  still  to  be  determined  by  choosing  the  optimum  F(v) 
to  substitute  into  Eq . (5.22).  For  U given  by  Eq.  (5.21),  Eq . (5.19) 
reduces  to 

D = — G Z C(v)\V(v)  g [z (v ) . (5.25) 

S 2 v 

To  this  must  be  added 

D = G I F(v)  (5.26) 

c v 

according  to  Eqs . (1.6) , (5.1),  and  (5.12).  For  any  given  set  of  func- 
tions W(v) , Y(v)  describing  the  atmosphere  and  A(v)  describing  the 
contrast-scale  of  the  object,  it  is  not  difficult  to  determine  F(v)  by 
minimizing  the  sum  of  Eqs.  (5.25)  and  (5.26).  With  the  correction  that 
sums  over  v omit  the  term  v = 0,  we  may  drop  the  factor  C(v)  in  Eq . (5.25) 
and  obtain 

9-1 

D = (2A\  ) I W (v ) [g  + (1/1)  zV (v) ] , (5.27) 

v 

with 

(v)  = 8A7  (v)  [£(v)'J  1 [W(v)l  . (5.28) 

It  is  convenient  to  take  g as  the  independent  variable,  using 

(1/ 1)  z = g 2 (1  - g)  , (5.29) 

so  that  g runs  over  the  range  from  1 to  0 as  z runs  from  0 to  ®,  Then 
the  minimum  of  D occurs  when 
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*(v)  = g (2  - g) 


for  V(v)  ^ 1 


(5.30) 


g = i ( z = 0 , for  f(v)  ^ 1 . (5.31) 

The  analysis  thus  gives  a natural  cutoff  to  the  wavenumbers  that  it  is 
desirable  to  compensate  by  feedback.  The  critical  wavenumber  V is  de- 
fined by 


i|r  (V)  = 1 . (5.32) 

For  wavenumbers  v|  > V,  F(v)  = 0 and  there  is  no  feedback.  The  total 
number  of  instrumented  modes  is  then 

n = Mr.)"1  AV2  . (5.33) 

If,  instead  of  using  sinusoidal  waves  to  define  the  modes,  we  divide  the 

mirror  into  patches  and  drive  each  patch  in  the  appropriate  fashion,  the 

optimum  number  of  patches  will  also  be  approximately  equal  to  n.  If  we 

2 

let  each  patch  have  size  P as  defined  by  its  area  (1/1)  n P , then  the 
size  of  patch  determined  by  Eq . (5.33)  is 

p = tv"1  (5.34) 


Since  we  are  supposing  the  number  of  instrumented  modes  to  be  large, 
we  replace  the  sum  in  Eq . (5.27)  by  an  integral.  Assuming  W(v)  and  t(v) 
to  depend  only  on  the  magnitude  of  v and  not  on  direction,  we  obtain 


D = 


(4  nX  ) 


C 

7 


vW(v)  dv[g(3  - 2g )/ (2  - g)1 


(5.35) 


This  integral  divides  into  two  pieces  according  to  Eqs . (5.30)  and  (5.31). 
In  the  low-v  range  it  is  convenient  to  take  g as  variable  of  integration. 
Then 
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D = ( InX 


/ 


vW(v)  Jv 


2 I 2 

+ (2."\  ) dg[(3  - 2p)  (3  - p)/(2  - p)  ] 


v W(v)  (d  lop  v/d  lop  v) 


(5.36) 


To  obtain  numerical  results  we  must  finally  make  specific  choices 
for  the  functions  W(v),  Y(v),  and  A(v).  We  suppose  the  atmospheric 
covariance  function  at  equal  times  to  be  piven  by  a power  law 


U(5  0)  = x2(5/pf 


(5.37) 


of  which  Tatarsky’s  hypothesis  (3.25)  is  a special  case.  W’e  assume 
0 <'  3 < 2 , so  that  the  Fourier  transform  of  U is  nopativo,  and  accordinp 
to  Eq.  (5.21), 


V W (v ) = c(3)  X 2 ( pv ) P 


(5.38) 


c(3)  = 2P+2  sin(l  2 "3 ) [T<  1 + 1/2  3 ) ]2 


(5.39) 


W'e  suppose  the  rate  of  decay  of  atmospheric  fluctuations  to  be  propor- 
tional to  their  spatial  wavenumber;  thus, 


Y (v)  = Ypv  , 


(5.  10) 


where  Y is  the  decay  rate  correspondinp  to  fluctuations  at  the  seeinp- 
patch  size  p.  The  assumption  (5.10)  will  certainly  be  valid  for  fluctua- 
tions that  decay  throuph  beinp  convected  out  of  the  lipht-path  by  the 
wind.  It  is  plausible  that  Eq.  (5,-10)  will  be  approximately  valid  also 
for  fluctuations  decayinp  by  turbulent  diffusion.  Finally  we  must  choose 


-13 


Mv),  which  depends  on  the  detailed  structure  of  the  object.  For  defi- 
niteness we  make  the  simplest  choice,  taking  £(v)  independent  of  v . If 
we  assume  in  the  definition  (5.17)  of  Mv)  that  a certain  fraction  T)  of 
the  light  from  the  object  lias  very  sharp  angular  structure  vhile  the  re- 
mainder is  very  diffuse,  then  we  obtain  the  estimate 


where  is  the  total  flux  of  detected  photons,  and  the  mean  wavelength 
2 A is  defined  by 


-•> 


0(<|,G) 


dqd9 


(5.42) 


We  take  Eq . (5.11)  to  be  the  definition  oi  ",  the  contrastiness  of  the 
object.  Putting  together  Eqs . (5.38),  (5.10),  and  (5.11),  we  obtain 
from  Eq.  (5.28) 


Y (v) 


-1 

N 


(pv) 


5+2S 


(5.  13) 


N = {3  c(g)  “/ 1]  (p,"20/AY)  . (5.44) 

We  write 

6 = (5  + 2S)  1 , C = ? (5  + 23)’1  . (5.45) 

Then  the  optimum  patch  size  defined  by  Eqs.  (5.32)  and  (5.34)  is 

p = ’l’  N ’ • (5.46) 

This  is  the  patch  size  mentioned  in  Eq . (1.2)  of  Section  I.  The  optimum 

patch  size  makes  the  two  terms  in  Eq . (5,36)  of  comparable  magnitude, 
the  first  arising  from  uncompensated  phase  errors  within  the  individual 
patches,  and  the  second  from  compensated  phase  errors  between  different 
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patchet . The  minimum  D is 


D = k ( B ) N 


-e 


(5.47) 


wi  th 


k(S)  = tc(g)/4-] 


1 

[ 

+25  I (3  - 

* 


2g)  (3  - g)  (2  - g)€  2g  3~ dg) 


K 5.48) 


When  S = 5/3,  e = 1/5,  then  c(p)  = 5.6182,  k(g ) = 0.9313.  Equation  (5.47) 
can  also  be  written  in  the  form 


4 2 2 -e 

D = f(£)  [p  ^ o/yd  ] 


(5.49) 


with  the  coefficient 


f (2) 


1 

2 


rc(B)i2 


-e 

k(=) 


(5.50) 


When  £ = 5/3,  then  f(£)  = 0.4805,  and  Eq.  (5.49)  becomes  the  equation 
quoted  in  the  Abstract  [Eq.  (1.1)].  The  fact  that  the  coefficient  f is 

less  than  unity  is  unexpected  good  fortune.  We  do  not  know  how  sensitive 

f may  be  to  the  various  simplifying  assumptions  that  we  have  made. 

One  possible  sensitivity  of  the  coefficients  that  con  easily  ba 
tested  is  their  dependence  on  the  atmospheric  exponent  g.  One  might  sus- 
pect that  the  coefficients  at  g = 5/3  are  fortuitously  small  because  5/3 

lies  close  to  2.  At  g = 2 the  covariance  function  (5.37)  has  a singular 
Fourier  transform  and  k(g)  = c(g)  = 0.  An  atmosphere  with  g = 2 is 
physically  unreasonable  and  the  theory  then  becomes  meaningless.  In 
fact,  however,  the  coefficients  are  uniformly  small  over  the  whole  range 
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° < 3 < 2 for  which  tho  thoo,'y  is  applicable.  A table  of  numerical  value 
is  supplied  in  Appendix  E. 


We  have  specified 
kernel  by  Eqs.  (5.  M) 
dependence  of  F and  N. 


the  y-dependenco  and  x-dependence  of  the  feedback 
and  (5.22),  but  we  have  not  yet  looked  at  the  v- 
It  is  the  v-depondcnce  that  determines  the  spatial 


pattern  of  the  feedback  on  the  mirror.  According  to  Eqs 
(5.29),  and  (5.30), 

F(v)  = i W ( v ) g(l  - g)  (2  _ B)_1 


(5.23),  (5.28), 


(5.51) 


At  long  wavelengths  (small  pv) , we  have 


22  -a_9 

W(v)  - ) p (pv)  i 


(5.52) 


!!(V>  - N'1/3(pv)(1/3><5*25> 

♦ 

by  virtue  of  Eqs.  (5.30)  and  (5.43).  Therefore, 

F(v)  - X2pV1/3(pv)'(1/3)(^1) 

Thxs  means  that  the  tremor  covariance  function  has  the  behavior 

F(x,x  ) - >2N_1/3(jx  - x !/p)"(l/3>(5“3) 
for  separations  x - x i; 


(5.53) 


(5.54) 


(5.55) 


(5.55)  is  (-10/9)  for  3 = 5/3.  This  s 


large  compared  with  p.  The  exponent  in  Eq. 


have  strong  long-range  correlation  of  t 


shows  that  the  optimized  system  must 


remor  amplitudes  at  widely  sepa- 


rated points  on  the  mirror. 

From  Eqs.  (5.22)  and  (5.51)  we  see  that  the  quantity 
J*N(v,x)  dx 


(5.56) 
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t 


has  the  same  v-dependence  as  F(v).  The  spatial  Fourier  transform  of 
Eq.  (5.56)  is  the  feedback  kernel 

N(x  - x'  , y , 0)  , (5.57) 

taken  with  the  time  of  the  mirror  points  x,  x'  infinitesimally  later  than 
the  time  of  the  image-point  y.  This  equal-time  feedback  kernel  therefore 
has  the  same  behavior  as  F(x,x' ), 

N(x  - x'  , y , 0)  - (jx  - x"  I /p)  (1  3^5  ^ f (5.58) 

for  pai?s  of  points  widely  separated  on  the  mirror.  It  is  important 
that  in  setting  up  the  feedback  program  according  to  Eq.  (3.8),  we  include 
a long-range  interaction  feeding  the  tremor  amplitude  at  x'  back  into  the 
mirror  response  at  x.  If  the  Tatarsky  model  atmosphere  is  realistic,  the 
instantaneous  feedback  should  vary  with  the  inverse  (10/9)  power  of  the 
distance.  The  feedback  at  later  times  will  decrease  with  distance  even 
more  slowly. 

We  have  come  to  the  end  of  the  description  of  our  ideal  optimized 
and  linearized  feedback  system.  It  remains  to  discuss  how  relevant  this 
description  may  be  to  the  requirements  of  the  real  nonlinear  and  non- 
periodic world. 


VI  PRACTICAL  CONS I DERATIONS , EDGE  EFFECTS, 
AND  NONLINEAR  SYSTEMS 


The  most  obvious  deficiency  of  the  periodic  big-mirror  approximation 
is  that  it  does  not  allow  the  mirror  as  a whole  to  tilt.  A tilt  of  the 


mirror  produces  a wavefront  displacement  e (x)  that  is  a linear  function 
of  x.  HUcn  we  identify  opposite  edges  of  the  mirror  we  exclude  the  pos- 
sibility of  a linear  component,  cither  in  the  atmospheric  waveiront  dis- 
placement a(x)  or  in  the  mirror  response  b(x). 

In  the  real  world,  atmospheric  disturbances  have  important  linear 
components  that  produce  displacement  of  the  optical  image  without  dis- 
tortion. In  small  telescopes  these  wandering  displacements  of  the  image 
nre  the  main  component  of  "seeing."  The  wandering  motions  can  be  cor- 
rected by  a servo-controlled  tilt  of  the  mirror.  An  image-centering 
system  to  remove  this  component  of  seeing  was  built  and  successfully 
applied  to  planetary  photography  by  Leighton  (1956), 

The  importance  of  the  mirror  edges  in  a real  telescope  is  shown 
most  clearly  when  we  use  as  criterion  of  image  quality  the  expression 


s'  = JJdci  dyf  I (q  ,y  ) - ^(q.y)]* 


(6.1) 


instead  of  Eq . (-1.1).  This  expression  looks  at  first  glance  roughly 
equivalent  to  Eq.  (4.1).  HTien  we  expand  s'  to  second  order  in  the  wa 
front  displacements,  we  obtain,  instead  of  Eq.  (4.1), 


S'  =11 dq  dz(q  /2tL)“  j M(q ,z) | 2[ j (Z)]J 


(6.2) 


Preceding  page  blank 
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j(z)  = / dx  e (x)  r\(x  - z)  - x(x  + z)] 


(6.3) 


whore  X(x)  is  defined  to  be  1 when  x is  a point  on  the  mirror  and  0 
otherwise.  The  expression  (6.2)  consists  entirely  cf  edge  effects,  since 
for  fixed  z the  integral  (6.3)  brings  contributions  only  from  points  x 
belonging  to  two  crescent-shaped  regions  within  a distance  |z|  from  the 
mirror  edge.  When  we  make  the  big-mirror  approximation  and  abolish  the 
edge,  X (x)  = 1 everywhere  and  Eq.  (6.2)  is  identically  zero.  In  the  big- 
mirror  approximation  the  first  non-vanishing  terms  in  s'  are  of  fourth 
degree  in  c (x)  and  cannot  conveniently  be  used  for  optimization.  But 
the  second-degree  terns  (6.2)  are  nonzero  for  a real  mirror.  The  edge 
effects  represented  by  Eq.  (6.2)  are  nonexistent  in  the  big-mirror  ap- 
proximation but  constitute  the  leading  source  of  image  degradation  in 
real  telescopes. 

To  explore  the  problem  of  edge  effects  in  a slightly  more  adequate 
fashion,  we  assume  that  the  mirror  is  verj  large  but  no  longer  periodic. 

We  assume  that  the  mirror  diameter  d is  large  compared  with  all  values 
of  z contributing  importantly  to  the  integral  (6.2).  Then  all  points  x 
contributing  to  Eq.  (6.3)  lie  close  to  the  mirror  edge.  We  make  the 
further  approximation  of  taking  e (x)  in  the  neighborhood  of  the  edge  to 

be  a function  only  of  the  azimuthal  angle  cp  of  the  point  x on  the  mirror. 
Then  Eq,  (6.3)  becomes 


j (z)  = 2-d(z  • e ) , 


(6.4) 


e - (2  ) I e (cp)  n (cp)  dcp  , 


(6.5) 


where  n(-p)  is  the  unit  vector  in  the  direction  cp  in  the  plane  of  the 
mirror,  and  e is  a vector  defining  the  centroid  of  the  distribution  of 
e(x)  around  the  mirror  circumference.  When  Eq.  (6.4)  is  substituted 
into  (6.2),  the  result  is 
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ffd  qdy  (a  • grad)  1^ <q ,y) 


(6.6) 


b>  virtue  of  Eq.  (3.2),  where  a is  the  vectoi 
a = (2L/d)  e 


(6.7) 


But  Eq.  (6.6)  is  just  the  value  of  S'  according  to  Eq.  (6.1)  when  the 
image  is  displaced  by  the  small  vector  a without  distortion.  This  dis- 
Placement  of  the  image  could  be  exactly  compensated  by  a rigid  tilt  of 
the  mirror  through  an  angle 


9 = (a/2L)  = (e/d) 


(6.8) 


S°  wo  roach  the  important  practical  conclusion  that,  insofar  as  edge- 
cffccts  can  be  attributed  to  the  azimuthal  distribution  of  wavefront 
displacements  e (x)  within  a narrow  strip  around  the  mirror  edge,  they  can 
be  completely  corrected  by  tilting  the  mirror.  This  is  possible  because 
only  the  dipol.  component  <6.5)  of  the  azimuthal  distribution  of  cM 
contributes  to  image  degradation  as  measured  by  the  criterion  S'  . The 
fact  that  the  dipole  or  tilt  component  of  the  wavefront  displacement  has 

a disproportionate  effect  on  optical  seeing  was  also  demonstrated  in  an 
earlier  analysis  by  Fried  (1965). 

To  summarize  the  foregoing  discussion,  we  may  say  that  there  are  two 
regions  of  the  mirror  for  which  the  general  analytical  machinery  of  Sec- 
tions III  and  IV  reduces  to  a manageably  simple  form.  First  there  is  the 
interior  region,  where  edge  effects  can  be  ignored  and  the  results  of  the 
periodic  big-mirror  approximation  apply  as  described  in  Section  V. 

Second,  there  is  the  extreme  edge,  where  according  to  Eqs.  (6.2),  (6.4), 

and  (6.5)  the  main  effects  of  wavefront  distortion  can  be  compensated  by 
a simple  tilt  of  the  mirror. 


It  is  an  interesting  exercise  to  rarry  through  the  analysis  of 
Sections  III  and  IV  in  detail  to  find  the  optimum  feedback  program  for 
the  control  of  mirror  tilt.  As  in  Section  V,  the  analysis  simplifies 
enormously  when  the  particular  modes  of  motion  under  consideration  are 
isolated  from  other  modes.  For  the  tilt  problem  we  take  into  account 
only  the  two  independent  linear  modes  of  wavefront  displacement.  The 
tremor  covariance  function  and  feedback  kernel  are  then  restricted  to  be 
of  the  form 


F(x,x'  ) = Z 
N(x,x'  , y , t ) 


F 

ij  lj 


/ 

X 

j 

(y,T 


X X 

i .1 


(6.9) 


(6.10) 


where  i,j  - 1,2  refer  to  two  coordinate  axes  in  the  plane  of  the  mirror 
The  optimum  feedback  kernel  has  y— dependence  given  by 


NiJ<y,T)  ■"kNik(T)  fIo(y)J  1 [*2lo<y)/ftyjdyk]  * (6'n) 

The  second  difference  that  appeared  in  Eqs.  (5.9)  and  (5.11)  is  now  re- 
placed by  a second  derivative.  The  expression  (4.7)  for  the  mean-square 
phase  error  reduces  to 


°s  = (G/-  J Tr  |f[l  - ANfcn+r1  C 

tl  - iN(uj)]"1  ru(x)  + A N(x)  A[N(uj)]^]J  , (6.12) 

whore  C and  a are  (2  X 2)  matrices  defining  the  mirror  geometry  and  the 
distribution  of  optical  contrast  in  the  image,  and  l(u;>  is  the  spectrum 
function  of  the  linear  component  of  the  atmospheric  wavefront  disturbance. 
Equation  (6.12)  is  minimized  in  the  same  way  as  Eq.  (5.16),  except  that 
there  are  now  only  two  modes  instead  of  a large  number.  The  final  result 
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of  the  minimization  is 


n f v_1/3  , ,,,(1/3) (5-0) 

D = f N (p/d)  k , (6.13) 

where  f is  a numerical  coefficient  ard  N is  Riven  by  Eq.  (5.44).  We 

have  here  made  the  same  approximations  and  used  the  same  atmospheric 

model  as  in  the  derivation  of  Eq.  (5.47).  Equation  (6.13)  is  exactly  the 

result  that  we  would  obtain,  apart  from  an  unimportant  numerical  factor, 

if  we  picked  out  of  the  sum  (5,27)  two  of  the  longest-wave  modes  with 
. | . -1/2  -1 

wave— number  vj  - 4^3  d . So  we  have  reached  the  comforting  con- 
clusion that  the  tilt  modes  do  not  be  lave  differently  from  the  internal 
modes  so  far  as  accuracy  of  control  is  concerned.  Numerically,  the  mean- 
square  phase  error  (6.13)  is  completely  negligible  compared  with  the 
error  (5.47)  due  to  internal  modes,  for  any  mirror  with  d >-»  p.  The 
analysis  of  the  tilt  modes  has  verified  that  J;he  edge  effects  are  small 
compared  with  internal  effects  for  a large  mirror,  confirming  the  naive 
argument  that  edge  modes  should  be  less  important  because  they  are  less 
numerous.  The  results  of  the  periodic  approximation  in  Section  V are 
thus  confirmed  as  being  valid  for  large  mirrors.  Of  course,  in  a real 
system  one  must  make  sure  to  include  the  tilt  modes  according  to  Eqs, 

(6.9)  and  (6.10)  together  with  tne  other  modes  in  the  feedback  program. 

The  last  problem  we  have  to  discuss  in  this  section  is  the  appearance 
in  the  feedback  kernels  (5.14),  (5.22),  or  (6,11)  of  the  object  intensity 
0(q,9)  or  the  ideal  image  IQ(y).  Neither  0(q,e)  nor  I^(y ) is  accurately 
observable.  How  then  can  we  use  these  quantities  to  define  the  feedback 
program?  There  are  two  ways  around  this  difficulty.  There  are  many 
astronomical  objects — for  example,  planets,  stars,  and  bright-nucleus 
galaxies— for  which  the  intensity  distribution  O(q,0)  is  known  with 
reasonable  accuracy.  In  these  cases  we  are  interested  in  observing  with 
high  resolution  the  fine  details  of  0(q,e),  knowing  its  gross  features 
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quite  well.  We  can  then  define  the  feedback  program  using  the  best 
available  approximation  O(q,0)  to  the  unknown  0(q,Q).  For  example, 
for  a planet  it  would  probably  be  sufficient  to  use  for  O(q,0)  a uniform 
disc  of  the  correct  size  and  spectrum.  A feedback  system  designed  to 
optimize  the  focusing  of  the  gross  features  of  the  object  will  automati- 
cally correct  the  wavefronts  so  that  the  unknown  fine  details  can  also 
be  seen  clearly. 

The  second  way  around  the  difficulty,  if  no  reliable  approximation 
O(q,0)  exists,  is  to  substitute  for  O(q,0)  an  average  I(q,L  fl)  of  the 
observed  image  I(o,Lq)  over  some  previous  period  of  time,  using  an 
averaging  process  of  the  type  of  Eq.  (3.10).  A scale  factor  (AL~2)  con- 
verts from  I (cj , L 0)  to  an  approximate  O(q,0)  in  accordance  with  Eq.  (5.6). 
The  average  may  be  taken  over  a few  seconds  or  minutes  of  observation  so 
that  the  photon  fluctuations  of  I(q,LG)  are  negligible.  The  entire 
feedback  system  with  I substituted  for  0(q,e)  is  no  longer  linear.  It 
has  become  a highly  nonlinear  system,  relying  on  the  bootstrap  effect  of 
an  initially  poor  imago  to  define  a feedback  to  produce  a better  image  to 
define  a better  feedback,  and  so  on.  If  the  bootstrap  succeeds,  the 
iteration  of.  the  feedback  produces  an  average  image  I(q,L  0)  that  is 
close  to  the  true  shape  of  the  object,  and  the  performance  of  the  whole 
system  is  then  adequately  described  by  the  linear  theory  of  this  report. 

If  the  bootstrap  fails,  I (q , L Q)  never  comes  close  to  0(q,6)  and  the 
linear  theory  is  irrelevant.  To  give  a theoretical  criterion  for  the 
success  of  the  bootstrap  lies  beyond  the  scope  of  a linear  analysis.  An 
analytical  treatment  of  the  nonlineal  system  using  the  methods  described 
in  this  report  is  unlikely  to  be  feasible.  To  find  out  whether  the  non- 
linear system  will  lock  itself  into  a good-image  regime  of  operation  or 
not,  we  must  probably  go  to  a full-scale  computer  simulation.  The 
computer  simulation  could  also  answer  such  questions  as  the  following. 
Would  it  be  advantageous  to  use  initially  an  image  I(q,L0)  averaged 
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over  a very  short  period  of  time,  so  that  fortuitous  moments  of  good 
seeing  would  cause  the  system  to  lock  into  the  good-image  regime?  Can 
we  diagnose  a good-image  lock-in  of  the  system  rapidly  enough,  and 
lengthen  the  time-average  in  7(q,L  6)  as  soon  as  lock-in  occurs,  so  that 
subsequent  atmospheric  fluctuations  or  moments  of  bad  seeing  do  not 
cause  the  system  to  come  unlocked?  Tht  se  and  many  other  questions  will 
need  to  be  carefully  explored  before  an  optimum  design  can  be  chosen 
for  a system  using  nonlinear  feedback. 
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VII  GENERAL  OPTIMIZATION  OF  THE  FEEDBACK  KERNEL 


After  the  special  approximations  of  the  last  two  sections  (big-mirror 
in  Section  V,  and  tilt-only  in  Section  VI)  we  now  return  to  the  problem 
of  optimizing  the  general  single-image  system  described  in  Sections  III 
and  IV.  We  shall  obtain  exact  expressions  for  the  feedback  program  of 
an  optimized  system,  independent  of  simplifying  assumptions.  Specifi- 
cally, the  aim  of  this  section  is  to  determine  the  feedback  kernel 
N(x,x'  ,y,t  - t'  ) appearing  in  Eq.  (3.8)  so  as  to  minimize  the  expression 
(4.7),  which  represents  the  mean-square  phase  error  of  the  system,  uith 
the  kernel  Q defined  by  Eqs.  (3.15)  and  (3.20).  It  is  convenient  to 
consider,  along  with  the  expression  Dg  which  has  the  tremor  amplitude 
products  replaced  by  their  averages  according  to  Eq.  (3.6),  the  similar 
expression  D'  which  describes  the  mean-square  phase  error  before  averaging 
over  tremor  amplitudes.  For  the  first  step  in  the  optimization  of  N, 
we  make  D'  a minimum  before  averaging.  The  choice  of  N that  makes  d'  a 
minimum  is  independent  of  the  c(x,t),  and  therefore  makes  the  average  Dg 

a minimum  also. 

We  use  some  of  the  notations  of  Appendix  D to  obtain  a more  compact 
expression  for  D . In  particular,  the  kernels  B(Y,X)  and  N(X,Y)  arc 
defined  by  Eqs.  (D-12)  and  (D-13).  We  introduce  the  equal-time  kernel 


C(X,X'  ) = 5 (t  - tx,  ) [A5  (x  - x'  ) - 1] 


(7.1' 


Then  D'  is  given  by  substituting  from  Eq.  (D-23)  into  (4. 5)--namely , 


Preceding  page  blank 
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D'  = (S 


ir7/// 


JXdX'  dX  dX 
1 2 


C(X,X')  [(1  - NB)"1  (X ,X1>3  [(1  - NB)"1  (X'  ,X2>] 

f.  1 u(X  - X ) f fdY  N(X  Y)  X (X  , Y)  I (Y)]  . (7.2) 

2 12  J 1 2 o 


This  can  be  written 


D'  = (S  )’1  Tr[  ( 1 - bV")"1  C(  1 - NB)"1 


i-  r U + S I X }]  , 

25  O 


(7.3) 


where  the  trace  means  a cyclic  integration  over  all  space  and  time 
variables,  the  adjoint  kernels  are  defined  by 


X (Y , X)  = N (X , Y) 


(7.4) 


and  the  kernel  1 is  understood  to  be  diagonal  in  space  and  time  vari- 
ables. Not  surprisingly,  the  structure  of  Eq.  (7.3)  Is  identical  with 
that  of  the  corresponding  expression  (6.12)  for  the  simple  tilt-only 
model  of  a feedback  system. 

The  condition  for  D'  to  be  stationary  under  variation  of  N is 


4-  4- 

Tr[  (R  B + R NI  ) £X  ] = 0 
1 2 o 


(7.5) 


for  all  6N  satisfying 


6N  ( Y , X ) = 0 for  t < t 

X Y 


(7.6) 
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I 


where  R^  and  R^  arc  kernels  involving  X-coordi nates  only.  As  in  Section 
V,  we  do  not  try  to  solve  Eq.  (7.5)  in  one  step  but  determine  first  only 
the  Y-dependence  of  N.  The  Y-dependence  consistent  with  Eq.  (7.5)  is 


+ - 1 + - 1 4- 

N = KB  (I  ) , N = (I  ) BK 

o o 


(7.7) 


where  K is  a kernel  operating  on  the  X-coordinates.  The  expanded  form 
of  Eq.  (7.7)  is 


N(X,Y)  = B(Y.X')  t 1 ( Y ) ] “ 1 dX' 


(7.8) 


This  means  that  the  feedback  from  image  at  Y to  mirror  at  X is  obtained 
by  applying  the  kernel  K to  the  response-function  of  the  image  at  Y to  a 
mirror-displacement  at  X' . In  Section  VIII  we  give  an  alternative 
heuristic  derivation  of  Eq.  (7.8)  that  may  make  its  pnysical  meaning 
clearer.  The  kernel  N(x,x  ,y,t  - t'  ) in  the  original  feedback  formula 
(d.8)  is  related  to  K bv 


/*  2 2 
(q  /2  I.)  dq 

J M(q,z)  exp(- 


z)  exp(-iq  z • y/I.)  dz 


[ K(x,  x + z,-)  +■  K(x , x - 7. ) - 2K(x,x'  ,-)] 

(7.9) 


Since  B is  an  instantaneous  kernel,  N will  have  the  correct  causal 

behavior  (vanishing  for  t < t ) if  K vanishes  for  t < t ..  Substituting 

*•  ■ X ^ 

from  Eq . (7.7)  into  Eq . (7.3)  we  find 


D'  = (Sj  )_1  Trf  (1  - T'  K^)'1  C(1  - XT'  )_1 


f-1/2  U + KT'K  ]1 


(7.10) 
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with 


T'  = B+(I  )_1  B 


(7.11) 


The  kernel  T'  is  instantaneous,  symmetric,  and  positive-definite.  The 
definition  (D-12)  of  B can  conveniently  be  written  in  the  form 


B(y 


’’ ,x>  ~ Jdx  °2(y,x  " x)  tc(x'>  - c(x)] 


(7.12) 


where 


B2(y,u)  = (2"L) 


-2 


J B2(y/L,k)  exp(-ik  • u)  dk  , 


(7.13) 


and  D2(e,k)  is  the  second  difference  of  the  object  brightness  distribution 
defined  by  Eq.  (5.9).  Equation  (7.11)  then  becomes 


T'  (x. 


’V  = 


If 


dx'  dx'  Z(x.  - 


fclXj)  - c(xi)]  lc(x'2)  - c(x2)]  , (7.14) 

with 

Z‘VV  =/Jy  fVy,rl  “2<V.U2)  . <7.15, 

In  the  analysis  up  to  this  point  we  have  refrained  from  averaging 
over  the  tremor  amplitudes  c(x,t),  and  we  obtained  the  result  (7.9), 
which  is  independent  of  tne  c(x,t).  For  the  second  step  in  the  optimiza- 
tion procedure,  determining  the  (x, O-dependence  of  N which  is  fixed  by 

the  kernel  K,  we  begin  by  performing  the  tremor  averages  using  Eq.  (3.6). 
Then  Eq.  (7.10)  becomes 


°S  (S1)  Trt(l  - TK  ) C(1  - KT)  1 (-  i U + Knct ) ] , 

(7.16) 


60 


with 


T<W  * jjdx i dX2  Z<*1  - *i  • *2  " X2> 


(FCXj.x^)  - FCx^x^)  - f(x1*x2)  + F(x1»x2)] 

(7.17) 


and  7.  given  by  Eq.  (7.15).  Since  T'  is  symmetric  and  positive  definite, 
so  is  T.  The  kernel  (-1/2  U)  cm  be  replaced  in  Eq.  (7.16)  by 


V = (1  - P)  (-1/2  U)  (1  - P) 


(7.18) 


where  P is  the  kernel  replacing  any  function  by  its  average  over  the 
mi rror — namely , 


P(x,x' ) = A 


(7.19) 


A being  the  area  of  the  mirror.  The  replacement  (7.18)  is  allowable 
since 

TP  = PT  = CP  = PC  = 0 , (7.20) 

by  Eqs.  (7.1)  and  (7. It).  The  kernel  V is  positive  definite  since,  by 
Eqs . (3.24)  and  (7.18), 

V( x , x ,t  - t'  ) = 'a  (x,t)  a'  (x'  ,t  ),  , (7.21) 

a'  ( x , t ) = a(x , t)  - A"1  Ja(|,t)  d*  . (7.22) 

Our  problem  now  is  to  determine  the  kernel  K,  subject  to  the  causality 
condition 


K(x,x'  ,t  - t'  ) =0  for  t < t' 


so  as  to  make 


D 


S 


a 


minimum. 


(7.23) 
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In  Appendix  F it  is  proved  that  the  minimum  of  D is  attained  when 
K = -L  , (7.24) 

where  L is  the  unique  causal  kernel  satisfying  the  integral  equation 

L 4-  L+  + LTL4  = V . (7.25) 

The  value  of  the  minimum  is 

D.  = (S  )_1  Tr[C(L  4-  L+)]  . (7.26 

S X 

If  we  denote  by  Tr  a cyclic  integration  over  the  space-coordinates  only, 
x 

Eq.  (7.26)  can  be  written 

D = (S  )_1  Tr  [CL(04-)]  , (7.27) 

S 1 x 

where  L(04-)  means  the  limit  of  the  kernel  L(x,x'  ,t  - t'  ) as  t -»  t'  from 

above.  When  we  use  the  definition  (4.2)  of  , and  add  to  the  term 

D representing  the  mean-square  phase-error  due  to  the  tremor,  we  obtain 
c 

for  the  total  phase-error, 

D = (\2 A2 )~ 1 Tr  ( C[ F 4-  L(04-)]}  . (7.28) 

x 

The  prescription  (7.7)  together  with  Eqs.  (7.24)  and  (7.25)  gives  a 
complete  formal  solution  to  the  problem  of  choosing  the  optimum  feedback 
kernel  to  use  with  a given  tremor  covariance-function  F.  The  heart  of 
the  matter  lies  in  the  integral  equation  (7.25),  which  can  also  be  written 

(1  4-  LT)  (1  4 L T)  = 1 4 VT  . (7.29) 

The  solution  of  Eq.  (7.25)  is  equivalent  to  a factorization  of  the  kernel 
(1  4-  VT)  into  a causal  and  an  anti-causal  factor,  with  the  causal  factor 
standing  on  the  left.  The  existence  of  such  a factorization  follows  from 
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an  old  theorem  of  Piomelj  (1908).  The  proof  of  < 

Appendix  F , existence  given  here  in 

provides  a constructive  procedure  by  means  of  which  L 

can  be  computed. 

lf  ,0Th°  analySl8,:f  tMS  SCC,‘™  ~ *°  - analysis  of  Scetfon  V 
Eqs  ( r ' b"!’"1,'r0r  aPPr0’“maU°”"  *»  i-  Particular, 

( .29)  aie  analogous  to  Eqs.  (5.19)  and  (5  iHi 

- „i„  diffOTOnco  botwoon  tho  two  anoiy;os 

mirror  approximation  all  the  kernels  F,  C T V L Lf  ^ 

diagonal  in  the  momentum-repeat  io  ’ SlmUltane°US^ 

entation  and  commute  with  each  other 

whereas  in  the  general  case  thev  do  not  + 

they  do  not  commute.  When  L,  i/,  and  T 
commute  with  each  other  th„  , _ . ana  T 

tner,  the  factorization  (7  29i  k 

PP  — — ■ on  hoth  sides,  and  thi  j h ^ 

rith.  in  Eq.  (5  „„  , : iS  th°  °rlB1"  °f  loga- 

• on  and  L do  not  commute  it  is  not  possible 
to  separate  the  two  factors  on  tho  loft  „f  , 

and  wo  have  , ’ by  “BlnB  loSarithms, 

nave  to  use  the  intepra,  cquafien  (7.25,  lnstoad  Df  „ 

formula  for  L.  explicit 

rrr:  - - ■—  - - 

),  but  is  independent  of  r fHnfi  • 

r st-nt  oi  l (defining  the  critnr^u  r 

the  system)  Tho  h optimizing 

3 Thc  choice  of  c was  arbitrary  and  it  i 

,u„t  ‘,nd  ls  reassuring  to  find 

any  positive-definite  Kernel  c would  do  dually  well  and  would  ,ead 

n - optimized  foedbach  pr.rum,  The  only  rcquiromq„,  that  . 

novo  imposed  on  C nnnv-f  f 

apa,t  from  Positivity,  is  Eci  <7  9m  u, 
that  C is  inrfiff  <7-20),  which  states 

indifferent  to  constnnt  nu 

..  Phases  and  measures  only  phase- 

orences.  The  special  choice  (7  1)  for  c c 
of  9 tri  t-  corresponds  to  the  choice 

2 KlVen  Ec'*  0.1)  for  the  criterion  of  n f 

tilth  this  choice  of  c Eq  (7  28)  H “ 

q.  (7.28)  reduces  to  the  staple  form 
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„ 2 -1 

D = (X  A)  Tr^fF  + L(0+) ] 


<7.30) 


In  deriving  Eq.  (7.30)  we  have  assumed  that  F and  L satisfy  dltions 

FP  = PF  = LP  = PL  = 0 


consistent  with  the  principle  that  a constant 
irrelevant. 


(7.31) 

phase-shift  is  optically 


In  the  analysis  of  this  section  we  have  ignored  the  supplementary 
condition  (3.18).  It  is  easy  to  verify  that  (3.18)  is  not  automatically 
satisfied  when  the  feedback  kernel  N has  the  structure  (7.7).  But  we 
can  always  satisfy  (3.18)  by  choosing  a kernel  K with  the  property 


/ 


K(x,x',  T)  d'  = 0 


(7.32) 


To  make  (7.32)  hold  we  have  only  to  subtract  from  K its  long-term  aver 
in  the  same  way  as  we  subtracted  I from  I in  Eq.  (3.8). 

Li 


age. 
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VIII  HEURISTIC  INTERPRETATION  OF  THE 
OPTIMUM  FEEDBACK  SYSTEM 

Experience  with  the  statistical  analysis  of  random  events  has  led  us 
to  consider  a "chi-squared  test"  as  the  most  efficient  criterion  for 
assessing  the  significance  of  deviations  of  an  observed  distribution  of 
events  from  a theoretical  model.  An  optical  improvement  system  must  use 
as  its  input  data  the  deviations  of  the  observed  photon  intensity  dis- 
tribution I (y)  from  the  ideal  image  intensity  I (y).  It  is  plausible 
that  an  efficient  measure  of  significance  of  the  deviation  of  I from  I 
is  provided  by  the  quantity  "chi-squared,'' 

\2  = fn  (y)]"1  [ I (y)  - I <y>]2  dy  , (8.1) 

J o o 

commonly  used  in  statistical  analysis.  A reasonable  strategy  for  an 

optical  improvement  system  would  be  to  control  the  mirror  displacements 

at  each  instant  in  such  a way  as  to  follow  the  line  of  steepest  slope  in 

2 

the  direction  of  diminishing  \ . 

The  linearized  expression  for  I(y)  in  terms  of  the  mirror- 
displacements  s (x)  is 

I (v)  - I()(y)  ~ j B(y,x)  s(x)  dx  , (8.2) 

with  B defined  by  Eq.  (U-12).  Wo  may  therefore  write 

B(y , x)  = ldl(y)/as(x)]  . (8.3) 

2 

The  gradient  defining  the  line  of  steepest  slope  of  x in  the  space  of 
displacements  s(x)  is 
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(8.4) 


2 2 

grad  X = td\  /os (x) ] 


= 2 Idy  B(y,x)  [I  (y)]  1 [ I (y ) - I (y ) ] 
o o 


or  in  operator  notation 


2 

grad  x 


2 B1  (I 


-1 


(I  - I ) 
o 


(8.5) 


The  natural  definition  of  a feedback  system  in  which  the  mirror- 

2 

displacement  b(x)  tries  to  follow  the  line  of  steepest  decrease  of  x 
would  be 


b 


1 2 4-  -1 

- K grad  X = K B (I  ) (I  - I ) 

2 o o 


(8.6) 


where  K is  a kernel  allowing  the  servo-system  to  integrate  signals  over 
a certain  area  and  over  a certain  time.  The  recipe  (8.6)  is  to  be  com- 
pared with  the  formula 


b = N(I  - I ) 
o 


(8.7) 


defining  the  feedback  kernel  N according  to  Eq.  (3.8).  The  form  of  N 

given  by  Eq.  (8.6)  is  identical  with  our  old  result  (7.7).  So  we  see  that 

the  heuristic  method  of  programming  the  feedback,  based  on  the  gradient 
2 

of  X , gives  the  same  result  as  the  rigorous  optimization  based  on  Eq. 
(4.12).  This  is  a us >ful  check  on  the  consistency  of  our  procedure, 
verifying  incidentally  that  the  treatment  of  photon  statistics  in  the 
derivation  of  Eq.  (4.12)  was  correct. 


If  we  substitute  from  Eq.  (8.2)  into  (8.1)  and  use  Eq.  (7.11),  we 

find 


2 

X 


JQF 


T'  (x , x'  ) 


s(x)  s(x'  ) dx  dx/ 


(8.8) 
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2 

If  the  tremor  amplitudes  are  averaged  in  computing  y , then  T'  is  re- 
placed by  T.  We  have,  therefore, 

T < x . X ) = “ .A2/os(x)  os(x'  )J  , (8.9) 

which  gives  a simple  interpretation  for  the  kernel  T and  explains  why  T 
plays  such  a prominent  role  in  the  optimization  problem.  T in  fact  de- 
fines  the  shape  of  the  valley  around  the  minimum  of  x in  the  multidi- 
mensional space  of  mirror-dirpl acements  s(x).  The  optical  improvement 
system  is  designed  to  hunt  as  efficiently  as  possible  for  the  bottom  of 
the  valley.  The  larger  and  more  rsotropic  the  kernel  T,  the  narrower 
the  valley.  The  mean-square  phase  error  D is  thus  strongly  dependent  on 
T and  decreases  as  T increases. 
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IX  FINAL  OPTIMIZATION  OF  THE  SYSTEM 


Tho  third  and  last  stop  in  tho  optimization  of  the  system  is  to 
ciiooso  the  tremor  covariance  function  F to  make  D given  by  Eq.  (7.32)  a 
minimum.  Both  here  and  in  the  previous  optimization  of  the  f odback 
kernel  N,  we  ignore  any  practical  limitations  imposed  by  discrete  struc- 
ture of  the  mirror.  Wo  consider  the  mirror  not  as  a finite  set  of 
rigidly  moving  facets  but  as  a continuously  deformable  surface  oii.o  which 
any  smoothly  varying  tremor  and  feedback  motions  can  be  programmed.  This 
treatment  of  the  mirror  as  a flexible  continuum  is  consistent  with  the 
way  a practical  system  would  probably  be  constructed.  (The  concept  is 
called  the  "rubber  mirror"  by  Dr.  Richard  Muller.)  Every  deformation  of 
a mirror  can  bo  expressed  as  a sum  of  an  infinite  series  (3.5),  each 
term  in  the  series  being  a suitably  chosen  normal  mode  (a  continuous 
function  on  the  mirror)  multiplied  by  a corresponding  amplitude.  We 
shall  shortly  prove  that  the  optimized  system  has  zero  amplitude  of  tremor 
for  all  except  a finite  set  of  modes.  It  is  a satisfactory  outcome  of 
tho  theory  that,  starting  without  any  assumption  of  discreteness,  it 
lo.' ds  automatically  to  an  optimum  in  which  only  a finite  number  of  discrete 
modes  need  to  be  instrumented.  Wo  can  imagine  a practical  system  in 
which  tremor  and  feedback  motions  near  to  the  theoretical  optimum  are 
produced  by  a driving  program  of  finite  complexity. 

To  minimize  Eq.  (7.30),  it  is  convenient  to  go  back  a step  to  (7,16), 
With  the  replacements  (7.18)  for  (-1/2  U)  and  (7.1)  for  C,  Eq.  (7.16) 
implies 

2-1  . f -i  4- 

D = (\  A)  TrfF  + (1  - GT)  V[  1 - (GT)  1 + (GT)T  (GT)  } , 

(9.1) 

Preceding  page  blank 
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with  G defined  by  Eq.  (F-l).  Since  this  choice  of  G makes  D a minimum 
for  fixed  F,  wo  have 


raD/S(GT)]  = 0 

and  therefore  the  condition  for  an  absolute  minimum  of  D is 


(dD/dF)  = (3D/dF)  = 0 


(9.3) 


In  evaluating  (SD/^F)  wo  hold  (GT)  constant,  so  that  the  term  in  V con 
tributes  nothing,  and  we  are  left  with  the  condition 

TrtG(dT)  G^]  = TrTdF]  , (9.4) 

for  all  allowed  variations  dF  of  F.  it  is  remarkable  that  the  compli- 
cated implicit  dependence  of  G on  F does  not  appear  in  (9.4).  The 
dependence  of  T on  F is  linear  and  is  given  explicitly  by  Eq.  (7.17). 
If  we  regard  the  quantity 


w<vy  - "‘w  w 


- //dxi  ds2  z<n  - V52  - y 


tMxrV  " s<xrxin  t6(vV " 6(vx2>] 


(9.5) 


a kernel  in  the  variables  (;  ,§  ) only,  with  the  variables  (x  ,x  ) 

1 4 12 

as  parameters,  then 


T = IKS  "‘vy  F<vy  ■ 


(9.6) 


II  we  define  a kernel  Y by 


+ . 


Y(X1'X2)  = TrtG  W(xi'x2)  G 1 - 


(9.7) 
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then  Eq.  (9.4)  becomes 


Tr  t(Y  - 1)  dF]  = 0 
x 


(9.8) 


for  all  allowed  variations  dF. 

Which  variations  dF  are  allowed?  Since  F is  a real  symmetric  non- 
negative  kernel,  it  can  be  written  in  the  form 


F(x  ,x  ) = Z F co  (x  ) cs  (x  ) 
12  aaoi  a 


(9.9) 


where  co  (x)  is  some  complete  set  of  normalized  orthogonal  functions  on 
O' 

the  mirror.  The  instrumented  modes  co  have  F > 0,  and  the  uninstrumented 

a a 

modes  have  F = 0.  Since  F satisfies  Eq.  (7.33),  there  is  one  uninstru- 

a 

mented  mode  that  is  constant  on  the  mirror  and  is  given  the  label  o = 0. 
Thus , 


-1/2 

CO  (x)  = A , F = 0 

o o 


(9.10) 


Otherwise  we  impose  no  special  requirements  on  the  cp  . Since  F must 

a 

remain  non-negative,  the  allowed  variations  dF  are  of  the  form 


dF(x  ,x  ) = Z dF  co  (x  ) * (x  ) , 

12  op  a 1 P 2 


(9.11) 


where  dF  = 0 unless  both  F > 0,  and  F > 0.  The  meaning  of  Eq.  (9.8) 

o-p  a p 

is  then  either 


JJ 


Y(x,x/  ) cp  (x)  cp  (x'  ) dx  dx'  = 6 . 

a 5 op 


(9.12) 


or 


F = 0 , or  F = 0 

Or  P 


(9.13) 


for  every  pair  of  indices  (^ , 0). 
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It  is  easy  to  verify  from  Eqs.  (9.5),  (9.7),  (7.15),  and  (7.13) 
that  V is  a completely  continuous  positive-definite  kernel  with  finite 
trace.  Therefore  the  number  n of  instrumented  modes  is  finite.  In  fact 
Eq.s.  (9.12)  and  (9.13)  imply 


nS  Tr  Y 
x 


(9.14) 


This  conclusion  is  satisfactory-  as  far  as  it  goes,  but  unfortunately  Y 

depends  implicitly  on  F.  and  so  Eq.  (9.14)  does  not  give  immediately  a 
numerical  estimate  of  n. 

Instead  of  defining  G by  Eqs.  (F-l)  and  (7.25),  we  may  use  the 
equation 


(1  - G T) ( 1 - GT)  = (1  + VT) 


(9.15) 


giving  G directly  in  terns  of  V and  T.  The  complete  solution  of  the 
optimization  problem  then  consists  in  solving  Eqs.  (9.6),  (9.9),  (9.12), 
(9.13),  and  (9.15)  for  F,  G,  and  T,  given  the  observational  quantities  V 
(describing  the  atmosphere)  and  W (defining  the  optics).  In  principle 
the  quantities  V and  W could  be  measured  and  the  kernels  F,  G,  and  T 
derived  from  Eqs.  (9.6)  through  (9.15)  by  numerical  computation. 

To  pursue  the  optimization  further  by  analytic  methods,  we  make 
simplifying  assumptions  concerning  the  input  quantities  V and  W.  We 
assume  that  there  exists  a complete  family  of  normalized  orthogonal 
functions  (Mx)  on  the  mirror  in  terms  of  which  the  kernels  U(x  - x', 
t ' t')  «ivc"  by  (3.24)  and  B^y,  x - x'  ) given  by  Eq.  (7.13)  are 
diagonal  for  all  values  of  (t  - t' ) and  y respectively.  Since  these 
kernels  are  functions  of  (x  - x'  ) only,  it  is  not  unreasonable  to  assume 
that  they  commute  with  one  another.  Our  assumption  includes  as  a special 
case  the  •'big-mirror  approximation"  of  Section  V,  which  neglected  entirely 
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\ 


the  effects  of  mirror  edges  and  took  the  cp  (x)  to  be  plane  waves.  A 

Of 

more  realistic  choice  for  the  cp  (x)  is  to  assume  that  they  are  eigen- 

Qf 

functions  of  the  Laplacian 


7 cp  (x) 
a 


-d  cp  ( x ) 

Or  0 r 


(9.16) 


with  the  boundary  condition 

(n  • grad)  <P  (x)  = 0 (9.17) 

Or 

at  the  mirror  edges.  For  a circular  mirror  the  cp  (x)  are  then  ordinary 

G 

Bessel  functions.  The  functions  defined  by  Eqs.  (9.16)  and  (9.17)  would 
be  a convenient  choice  for  the  design  of  a practical  system,  and  they  are 
reasonably  close  to  being  eigenfunctions  of  any  smoothly  varying  displace- 
ment kernel  K(x  - x'  ) on  the  mirror.  However,  we  develop  the  theory  for 

a general  family  of  functions  cp  (x),  and  do  not  assume  that  they  satisfy 

G 

Eqs.  (9.16)  and  (9.17),  The  only  special  property  that  we  require  oi  the 

V (x)  is  Eq.  (9.10),  Thus  the  projection  operator  P defined  by  Eq.  (7.19) 
u 

i s 

P(x,x'  ) = CP  (x)  cc  (x'  ) , (9.18) 

o o 

and 

Pep  = 0 for  a ^ 0 . (9.19) 

a 


The  assumption  that  U is  diagonal  in  the  functions  cp  (x)  implies, 

Jr 

by  Eqs.  (7.18)  and  (9.19), 

V(x,x',t  - t'  ) = I V (t  - t')  cp  (x)  “ (x'  ) , (9.20) 

a a a a 

with  or  = 0 omitted  from  the  summation.  We  shall  always  understand  a sum 
over  the  index  a or  0 to  omit  the  zero  mode  unless  it  is  stated  otherwise. 
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As  in  Suction  V',  wo  suppose  that  each  atmospheric  mode  decays  exponentially 
in  time  at  a constant  rate,  so  that 


V (t  - t'  ) r V exp(-'i  t - t'  ) 

a a a 


(9.21 ) 


and  the  behavior  of  the  atmosphere  is  completely  specified  by  the 


parameters  V ,t  , 
a > 


The  assumption  that  is  diagonal  implies 


9}(y»x  - x ) = 7 b (y)  fp  (x)  z (x'  ) , 

“ j 1 J 3 


(9.22) 


( zero  mode  included),  where,  by  Kq.  (7.12), 


_*> 


/■ 


b <>’)  = (2rl.)  |D,}((y/L),k]'c;  (k)  “ dk 


♦ (k)  = (x 


(x)  exp( - i k • x)  dx  . 


(9.23) 


(9.21) 


When  wo  substitute  from  Kq.  (9.23)  into  Kqs.  (7.15)  and  (9.5),  we  find 


= Z.-A  _«P  (x  > ^ (’  ,)  C3,(x.)  w.  (--  ) 
a?  a?  a 1 a 1 o 2 52 

A . = | J h(k , k'  )(  V (k)  “ - - (k)  2) 

'*  JJ  a o 


(9.25) 


( I » (k'  ) “ - l"  (k'  ) 2)  dk  dk' 

fi  o 


(9.2G) 


h(k.k')  = (2-rL)"’1  Jd}1  (y/D.klD^I  (y/L).k'lfI  }(y)]"1dy 


(9.27) 


Equation  (9.6)  then  reduces  to 


T = L F 
aS  a?  ar 


(9.28) 
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whore  T , F are  the  cp  -representations  of  the  kernels  T and  F.  Note 

that  the  right  side  of  Eq . (9.28)  is  an  ordinary  product,  not  a matrix 

product.  Up  to  this  point  we  have  not  assumed  that  F is  diagonal  in 

terms  of  the  same  family  of  functions  cp  that  diagonalize  U and  B 

a 2' 

We  have  to  solve  Eqs.  (9.7),  (9.8),  (9,15),  and  (9.28)  for  F,  G,  and 

T.  If  we  take  F to  be  diagonal  in  the  cp  -representation,  then  T is 

a 

diagonal  by  Eq.  (9.28),  G is  diagonal  by  Eqs.  (9.15)  and  (9.20),  and  the 
nondiagonal  part  of  Eq.  (9.8)  disappears.  We  therefore  obtain  a solution 
with  F given  by  Eq . (9.9).  This  means  that  the  tremor  motions  of  the  mir- 
ror are  imposed  on  the  same  set  of  normal  modes  for  which  the  atmospheric 

kernel  V is  diagonal.  We  write  T for  T , A for  L , define  G bv 

a aa  a aa  a 


G = Z G (t  - t'  ) V (x)  cp  (x'  ) 
a o a a 


(9.29) 


and  introduce  Fourier  transforms  G , V by 

a a 


G (*) 

a 


expdxT ) 


dr 


(9.30) 


so  that  C^Ou)  is  analytic  in  the  upper  half-plane.  Equations  (9.12), 
(9,15),  and  (9,28)  then  reduce  to  either 


/ 


G (x)  2dx  = 2 -(£  )_1 

a a 


or  F = 0 
a 


(9.31) 


1 - T G (x)  2 = [1  + T V (x) j_1 

a a a a 


(9.32) 


T = A F 
a a a 


(9.33) 


The  value  of  D given  by  Eq.  (7.30)  is 
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(9.34) 


D = <>  2A)  1 Z [F  + G CO,  ) ] 
a ' a a 


When  V^(t)  is  an  arbitrary  function,  the  solution  of  Eq . (9.32) 
can  be  obtained  by  the  method  explained  in  Section  II.  When  V (t)  has 
the  special  form  (9.13),  we  find 


a 


V (a)  = 2V  Y [ou2  + V2  _1 

0 Ot  Qt  Qt 


(9.35) 


G (x)  = iV  R [x  4-  i6  ] 

a a a a 


-1 


(9.36) 


G (t  ) s V « exp(-6  'T ) . t > o 

a a a a ’ ' 


(9.37) 


with  the  pirameters  K , 6 , z defined  bv 

a a a 


K = [(1  + z )1  2 - 1]  , 

a a a 


(9.38) 


6 - Y (1  + z ) 

at  a a 


1 2 


(9.39) 


z = (2V  T /Y  ) 
a a a a 


(9.40) 


Equations(9.31)  and  (9.34)  then  become  either 


K/(2  - K )]  = f 
Qt  Qt  Ot 


or  r = 1 
a 


and 


(9.41) 


D = (JlV1  I [F  + V B ] 

a a a a J 


(9.42) 


with 


i !r  = C 2Y  /(A  V )] 

a a a a 


(9.43) 
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The  optimization  procedure 

of  the  mirror  we  have  the  input 

by  Eq.  (9.43),  g by  (9.41),  z 
O'  Qf 


is  now  complete. 

parameters  V , V 
a a 


by  (9.38)  or  by 


For  each  normal  mode 

. £ • Then  V is  given 
cr  ( y 


/ 

O' 


a - k ) 

a 


(9.44) 


* " (9,40)’  6C  ^ (9-39)*  and  Fa  (9.33) . The  Kernels  F,  G,  T are 

‘hereby  determined.  There  is  only  one  important  remark  to  be  added 
Equation  (9. U)  has  a solution  with  0 < g 


1 only  when  < i or 
o 


2 

V A > 2Y 
0/  a a 


(9.45) 


"'hen  Va  * 1,  there  is  no  stationary  value  of  D for  positive  F . and  the 
minimum  of  D is  attained  at  ^ = 0.  The  theory  instructs  us  to  apply 
tremor  and  feedback  only  the  finite  set  of  modes  satisfyin*  Eq.  <9.45, 
The  instrumented  modes  are  the  modes  of  latest  linear  scale,  for  which 
a i-  M.atest  and  least.  The  uninstrumented  modes  are  those  of 
small  scale  and  hieh  frequ  r.cy,  for  .hich  compensation  by  fcedb!lck  „ 
impossible  because  there  are  not  enoukh  photons  available.  The  final 
formula  tor  the  mean-square  phase  error  of  the  system  is 


“ ' <‘"A>  1 V„Ka:<3  - - K )] 


(9.46) 


.here  is  plven  by  Eq.  (9.41,,  the  firs,  alternative  applying,  the 
instrumented  modes,  and  the  second  to  the  uninstrumented  modes. 

»’e  have  now  reproduced  the  results  of  Section  V Up  to  Eq.  (5  31) 

assumptions  that  are  more  keneral  and  more  realistic  than  the  hi*. 

mirror  approximation.  To  translitn 

an. late  f i om  the  notations  of  this  section 

to  those  of  Section  V,  we  have  only  to  substitute 


Q 


a 


i w(V) 


A A XA(v) 

Of 


(9.47) 
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the  remaining  symbols  being  unchanged.  The  results  of  Section  V from 
Eq.  (5.35)  onward,  where  we  supposed  the  number  of  instrumented  modes 
to  be  so  large  that  sums  over  modes  could  be  replaced  by  integrals,  hold 
unchanged  in  the  wider  framework  of  this  section.  When  the  number  of 
modes  is  very  large,  their  statistical  properties  are  the  same,  whether 
they  are  plane  waves  or  belong  to  any  other  orthogonal  family. 
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X CONCLUDING  REMARKS 


Wo  conclude  this  paper  with  some  remarks  concerning  two  questions 
that  are  still  open  lor  further  investigation.  One  unsolved  problem  is 
to  estimate  quantitatively  the  errors  that  arise  when  we  use  the  approxi- 
mate equations  of  Section  IX,  based  on  the  assumption  that  all  the  kernels 
U(x  - x , t - t ) and  Bt>(v,  x - x ) commute,  in  situations  where  these 
kernels  do  not  commute.  The  ideal  solution  to  this  problem  would  be  to 
solve  exactly  the  equations  (9.6)  through  (9.15),  which  are  valid  for 
non-commuting  kernels,  and  compare  the  results  with  the  approximate 
prescriptions  (9.37)  through  (9.17).  If  an  exact  solution  is  unattainable, 
the  practical  alternative  is  to  choose  commuting  kernels  []'  and  B^  , 
which  are  in  some  sense  close"  to  the  true  kernels  U and  B^,  and  define 
the  parameters  of  the  system  using  Eqs . (9.37)  through  (9.-17)  with  the 
approximate  kernels.  Then  the  question  becomes,  how  close  B and 
must  be  to  U and  B ^ , in  order  that  the  system  optimized  for  u'  and  B^ 
perform  well  in  the  real  situation  with  kernels  U and  B^.  This  question 
could  be  answered  by  calculation,  using  standard  methods  of  perturbation 
theory  with  U , B^  defining  the  unperturbed  system.  In  the  real  world 
the  kernels  U and  B ^ will  vary  from  night  to  night  or  from  minute  to 
minute,  and  will  never  be  exactly  known.  One  will  necessarily  design 
the  system  around  some  assumed  kernels  , B^  that  may  be  chosen  to  be 
commuting.  The  perturbat ion-theory  analysis  will  show  how  wide  a range 
of  atmospheric  and  optical  conditions  the  system  so  designed  should  be 
able  to  handle.  It  is  reassuring  to  find  that  the  most  dramatic  result 
of  the  approximate  treatment — namely,  the  conclusion  that  the  tremor 
kernel  F has  a finite  number  of  nonzero  ir>odes--is  also  true  in  the  exact 
theory . 
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If  an  image-improvement  system  of  the  type  here  proposed  ever  be- 
comes operational,  one  valuable  byproduct  of  its  operation  will  be  a mass 
of  precise  data  concerning  the  statistical  behavior  of  atmospheric  fluc- 
tuations in  space  and  time.  These  data  will  be  easily  extracted  from 
recordings  of  the  servo-signals  fed  by  the  system  into  the  circuits  con- 
trolling the  mirror  surface.  No  data  of  comparable  accuracy  and  com- 
pleteness are  at  present  available. 

The  second  topic  of  this  section  is  the  proper  general  definition 
of  the  quantity  appearing  in  the  fundamental  formula  (1.1)  which  is 
supposed  to  summarize  the  results  of  the  entire  theory.  We  called  T| 
the  "contrastiness"  of  the  object,  and  defined  it  by  Eq.  (5.41).  That 
definition  was  unsatisfactory  because  it  applied  only  to  the  special 
case  when  the  quantity  A(v)  was  independent  of  v.  We  here  propose  a 
somewhat  more  general  definition. 

When  the  number  of  modes  is  large,  we  may  suppose  that  each  normal 
mode  is  mainly  made  up  of  Fourier  components  with  wavenumbers  con- 
centrated about  some  mean  value  k . Then  Eq . (9.26)  implies 

o 

A-  = (2n)  ' h(k  • (10.1) 

op  OP 


So  we  may  define  a mode-dependent  coefficient  ^ by 

o 


2 -4 

A = 6 T1  >,  (JVA) 
o o 


(10.2) 


instead  of  Eq . (5.41),  and  then  Eqs . (9,27)  and  (10.1)  give 


l2  = (>/L),(A/; 
o 


fi D [ (y/L) ,k  ]} 2 [ I (y)]"1 
J 2 a o 


dy 


(10.3) 


We  write 


02(9,v) 


dq  0[q  ,9  + 


(v/q) ] 


(10.4) 
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’O  that  Eq.  (5.9)  becomes 


D (9,v)  = 0 (9,v)  + 0 (6,-v)  - 2 0 (9.0) 
^ 2.  2 2 


(10.5) 


and  Eq.  (5.12)  gives 


•> 

<:/\~) 


f 


= I <ye.o)  d9 


(10.6) 


The  autocorrelation  function  g(rp)  of  the  object  brightness  distribution 
is 


(v  - cp  ) = [|0(9  + cp)0(6  + 


)de  !/{/*[0(9)]2  cie} 


(10.7) 


To  obtain  a simple  expression  for  we  assume  that  the  integral  in 
Eq.  (10.3)  can  be  estimated  by  using  the  rough  approximation 

foJ(y  E),vj  02[(y/L),v'j  , 1^  (y)  _1  dy 

= (L  ) (PA)  u ‘ (v  - ) , (10 


Then  Eq , (10.3)  reduces  to 


a 


1 


(1/3)  u(‘k  ) + (1  '3)  u ( 2>  k ) 


(10.9) 


the  tunction  u being  normalized  so  that  g.(0)  = 1 by  virtue  of  Eq . (10.7) 
Equation  (10.9)  is  not  i.xact  but  gives  a qualitatively  correct  picture. 
It  shows  ^ to  be  the  fraction  of  the  incident  light  contained  in  struc- 
tures with  an  angular  scale  smaller  than  (\k  ).  If  we  assume  the  object 

a 

autocorrelation  function  to  have  the  special  form 
J2  2 

= Un*U^  + (1  “ ) U (u)  , (10,10) 

with  describing  a very  narrow  and  n a very  wide  structure,  then 


81 


Eq . (10.9)  gives 
definition  of  . 


■n 


lor  all  a.  and  Eq . (10.2)  reduces  to  the  old 


lor  practical  estimates  of  system  performance,  one  may  u.<e  Eq . (1.1) 
2 

with  defined  by  averaging  Eq . (10.9)  over  the  n instrumented  modes 
a— namely , 


= 1 - n Z [ ( I 3)a  ().k  ) - (l/3)u.(2'*k  )J 


(10.11) 


When  the  number  oi  modes  Is  large,  the  sum  may  be  replaced  by  an  integral, 


and  we  obtain 


.2  2 -1  C 

- 1 - <s  v » J 


1“  Ok)  - ^ (2-  k ) dk 


(10.12) 


with  the  integral  taken  over  the  circle  k < V,  and  V defined  by  Eq . 

(a, 32).  Equation  (10,12)  states  that  is  roughly  equal  to  the  fraction 

ol  incident  light  belonging  to  structures  with  angular  scale  smaller  th^n 
( \),  which  by  Eq . (5,3  1)  is  .just  the  scale  of  the  uncorrected  optical 
seeing . The  practical  definition  of  " is  thus  Eq.  (10.12)  with  „(v) 
given  by  Eq . (10.7).  II  Eq . (10,12)  is  not  accurate  enough,  then  one 
should  go  back  to  the  exact  formula  (9.17),  with  g given  by  (9.12)  and 
defined  for  each  a separately  by  Eqs . (10.2)  and  (10.9). 
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Appendix  A 
PROOF  OF  THEOREM 


Let  the  input  a(t)  be  Fixed  while  averages  are  taken  over  photon 
statistics  only.  We  first  compute  the  probability  p(t)  per  unit  time 
for  a photon  to  arrive  at  time  t.  Equation  (2.2)  gives 

p(t)  = A + B <s(t)>  . (A-l ) 

P 


while  Ecps . (2.1)  and  (2.7)  give 


<s(t))^  = a (t ) + fsa  - t'  ) [p(t')  - A]  dt' 


(A-2 ) 


Therefore 


-/b 


p(t)  - f BN ( t - t'  ) p(t'  ) dt  = [1  - BN(O) ]A  + Ba(t)  , (A-3) 


from  which  the  Fourier  transform 


p('Ju)  = 2nA6(x)  + [1  - BN(uj)  Ba(ui) 


(A-  1) 


is  determined  explicitly. 

We  next  repeat  this  analysis  for  the  probability  p(t,u)  per  unit 
time  for  two  distinct  photons  to  arrive  at  the  times  t,  u.  Suppose  '.hot 
t > u.  Then  Eq . (2.2)  gives 

p(t,u)  = Ap(u)  + Bp (u ) (s(t)>  , (A-5) 

p,u 


where  the  average  is  taken  only  over  photon  histories  in  which  a photon 
arrived  at  time  u.  Equations  (2.4)  and  (2.7)  give 
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p(u)  (s(t)>  = p(u)  a (t ) + X ( t - u) 

P ,u 


+ ^*N(t  ~ * ) P(t/  ,u ) - Ap(u)  dt' 


Therefore 


/" 


p(t,u)  - Ids (i  - t' ) p(t'  ,u ) dt' 


= { Ba ( t ) + BX(t  - u)  + A[ 1 - BN ( 0 ) J } p (u  ) 


(A- 6) 


(A-7 ) 


We  write 


q(t  ,u)  = p(t  ,u)  - p(t ) p(u) 


(A-8) 


Then  Eqs,  (A-3)  and  (A-7)  imply 


(|(t,u)  - |BN(t  - t )<|(t  ,u  )dt  = BN  ( t - u)p(u) 


(A-9) 


We  cannot  solve  Eq . (A-9)  immediately  since  it  holds  only  for  t > u . 
Instead  we  apply  to  both  sides  the  operation 


■S' 


1 - |BN(u  - n ) du 


and  so  obtain 


q(t 


BN(  t - t ' ) <|  ( l , u ) dt ' 


, u ) - J BN 
- JdS(u  - ll')q(t,ll  ) du ' 


BN(u  - u )q(t  ,u  )di/  +11  B N (u  - it'  )N(t  - t'  )q(t  , i/  ) 


dt  dll'  = BN(t  - u)p(u)  -J  B~N(u  - n'  )N(t  - u'  )p(u'  )du' 

(A- 10) 

This  holds  lor  t u , since  t u implies  t u'  inside  the  integrals. 
But  the  left  side  of  Eq . (A-10)  is  symrne*  ric  in  t,  u , and  ,^o  is  the 
second  term  on  the  rip;ht . To  make  the  right  side  symmetric  we  need  only 
add  the  term 
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BN (n  - t)  p(t) 


( A-  1 1 ) 


which  is  zero  lor  t > u.  Therefore  Eq . (A- 10)  with  the  additional  term 

holds  lor  all  t,  u.  We  can  now  take  the  Fourier  transform  and  obtain  the 
simple  result 

<1  (x , x ) = {[1  - BN(uu)]_1[l  - BN (u/  ) J ~ 1 — 1 } p(x  + x'  ) 

( A— 12) 

The  last  step  in  the  proof  of  Eq . (2.10)  is  to  compute  the  quantity 

J(t,u)  ==  <s(t)  s(u)>  - <s(t)>  (s  (u)> 

P P P 


(b(t)  b(u)>  - <b(t))  (b(u )) 

P P P 


Equations  (2.1)  and  (A-8)  imply 

J(t,u)  =^|*N(t  ~ * )N(u  - u ) dt  (lu/ 

q(t'  ,u'  ) + 5 (t'  - u'  ) p(u'  ) ] 

Taking  the  Fourier  transform  and  using  Eq . (A-12),  we  deduce 


(A- 13 ) 


(A-l  1) 


JU.x')  = N(x ) N(x')  1 - BN(aj)  _1[1  - BN(x')]'1  p(x  + J ) 

(A- 15 ) 

Now  Eq.  (A-l)  gives 


<s(x)>  = fl  - BN(x)  _1  a (x) 

P 


( A- 1 6 ) 


Therefore  Eq-  (A- 13)  and  (A- 15)  imply 
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(s(x)  s(u/))  = [l  - BN(x)  ] 1[l  - BN^)] 

P 


iv  >v 


[a(xl  a(x'  ) + N(x)  N(x'  ) p(x  + x'  ) ] 


(A-17 ) 


Wc  now  take  averages  over  the  random  variable  a(t)  using  Eqs.  (2.1)  and 
(A-l).  We  have  then 


(s(x)  s(x7  ))  = 2n&  (x  + x'  ) 1 1 - BN(x) 


[U(x)  + A N(x)  ] 


(A-18) 


Since 


D = (2r)~2JJ(s(u)  s(x7  ))  dx  dx7  , 


(A-19 ) 


we  have  proved  Eq . (2,10). 


We  next  turn  to  Eqs.  (2.11)  through  (2.14).  The  function 


<P(x)  = [1  - BN (x) ] 


-1 


(A-20) 


must  be  regular  in  the  upper  half-plane  if  the  servosystem  it  to  be 

stable.  Let  cp  (x)  be  the  inverse  of  the  expression  on  the  right  side  of 
o 

Eq . (2.11).  This  is  also  analytic  in  the  upper  half-plane  and  has 
boundary  values  for  real  x sati  fying 

cpo(x)  = [1  + IU(x)  ]”  . (/-21) 

To  prove  Eq.  (2.11)  we  must  show  that  cp(x)  = c?  (w)  for  the  minimum  D. 

o 

We  write 


Cp  (x)  = 1 - f(x)  , (A-22 ) 

o 


[V(x)/cpQ(x)  ] = 1 + g(x) 


(A-23 ) 
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so  that  both  1 and  g arc  analytic  in  the  upper  hall-plane  and  tend  to 
zero  at  least  as  last  as  it  1 at  infinity.  Manipulation  of  Eq . (2.10) 
us  inn  E(|s.  ( A-  2 0 ) through  (A-23)  gives 

D = (1/2  I)  Jd'ju{2  Ref  (x)  + 2 Re  ' f (u))g  (uu)  ] + g(ui)  2}  (A-21) 

-2 

Tile  term  in  Re(fg)  vanishes  since  1 Ox)  g(x)  is  analytic  and  of  order  x 
at  infinity.  Therefore  D is  a minimum  when  g(x)  = 0,  and  Eq.  (2,11)  is 
proved . 

To  determine  the  minimum  value  of  0,  we  use  the  fact  that  the  lunc- 

t i on 

f (x ) + log  1 - 1 (x)  (A-25) 


is  also  analytic  and  ol  order  x at  inlinity.  Therefore 


D = ( 1/  2.ul ) J dx  f-2  Re  log[l  - 1 (x) ]}  . (A-26) 

Using  Eqs.  (A-22)  and  (A-21)  we  deduce  the  first  half  of  Eq . (2.11). 

The  second  half  of  Eq . (2.11)  follows  from  Eq . (2.11)  by  the  same  argu- 
ment. Finally,  Eqs . (2.11)  and  (2.15)  imply 


D < (1  2 I) 


/ 


-2 

logLl  + IMx  dx 


which  is  identical  with  Eq . (2.16), 


( A-27 ) 
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Appendix  D 


PROOF  OF  EQUATION  (2.30) 
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Appendix  B 


PROOF  OF  EQUATION  (2.30) 


The  function 


f(x)  = X X(1  + x)  1ok(1  + X) 

.1 


I 


(d\/>. ) Ll  - (1  - ><1  + Hx) 


-1 


(B-l) 


is  concave  for  positive  x.  For  any  function  tf(x) , we  define  an  averag- 
ing process  by 

Av  ^ (ui)  ] = (r.Y)"1  J*iU(x)[l  + IU(x)  J 1 k(x)  dx  , (B-2) 

ttith  Y given  by  Eq . (2.29).  The  concavity  of  x implies 

Av { 1 [ g ( x ) J } * f £Av[k<uu)  ’ . (B'3) 

Take  g(x)  = IU(x)  in  Eq.  (B-3).  The  left  side  is  (2ID  Y)  by  Eq.  (2.11). 
The  right  side  is  f[(z)  -1  , with  /.  defined  by  Eq . (2,31).  The  in- 
equality (2.30)  is  thus  identical  with  Eq . (B-3). 
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Appendix  C 


PROOI  01  EQUATION  (2.20) 
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Appendix  C 


PROOi  OF  EQUATION  (2.20) 


W'°  ar°  t0  proVe  Ef'-  <2‘20>  Y replaced  by  or  Y^  defined  by 

ElN  (2'J2)  °r  (2*33)*  1,1  the  case  of  Y]t  Eq.  (2.20)  follows  immediately 

iV°n  Eq*  (2,1,)  and  th°  ^istence  of  the  upper  bound  (2.31).  To  deduce 


the  result  for 


Yo • we  uso  Cauchy's  inequality 


jf[V(x)]1/-clx  I ^ [J( a + b x2)  dxj  j J(a  + 
= (ab)  |a^* t'(<ju)  dx  + b f lT(x)  x2d 


) U(U)>  clou  I 


valid  for  all  positive  a,  b.  Taking  the  minima 
deduce 


. (C-l) 

m on  the  right  side,  we 


JiU(x)]1 


U.a,  u say,  Y,  * V . So  the  result  for  Y,  is  a.  „ proved. 
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. Appendix  D 

CALCULATION  OF  PHOTON -PHOTON  CORRELATIONS 
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Appendix  D 

CALCULATION  OF  PHOTON- PHOTON  CORRELATIONS 


The  calculations  in  this  appendix  are  closely  parallel  to  those  of 
Appendix  A.  The  similarity  is  obscured  by  the  multiplicity  of  variables 
and  integrations  in  expressions  such  as  Eqs . (3.15)  and  (4.7).  The 
essential  first  step  is  therefore  to  introduce  a compact  set  of  notations 
so  that  the  logical  structure  of  the  calculation  becomes  clear. 

By  X we  denote  the  space  and  time  coordinates  (x.t^)  of  a Point  on 
the  mirror.  By  Y we  denote  the  coordinates  (y,t^)  of  a point  oi  the 
image.  A function  of  one  point  X or  Y we  call  a vector.  Vectors  are 

denoted  by 


Ai  * 0(V 


A2  = a(X2)  , °t‘ 


(D-l) 


S = <s(X  )> 

1 1 P 


(n-2) 


s - <I<Yi»P 


(D-.3) 


I = I (Y  ) 
01  o 1 


(D-4) 


A function  oi  two  points  X or  Y is  called  a bivector.  Bivectors  are 


S.2  = <SV  S(X2»p 

»12  ' <S‘Y  I<V2)>„ 

■12  = <i<yi>  ,,Y2)>p 


1 

C 
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J 

I 


(D-8 ) 


A = 6 (V  - y ) 6(t  - t ) < I ( V )> 

12  1 2 VI  Y2  1 1 


J = S - S S 
12  12  1 2 


D-9  ) 


Z = M -SI 
12  12  12 


(D-10) 


K = I - I I 
12  12  12 


(D- 11) 


the  last  equation  being  short-hand  for  Eq . (3.3).  Finally  we  have  two 
kernels  B and  N defined  by 

B ( Y , X ) = J(q2/2nL)2dq  Jdx  ' [c(x  ' . t^,)  - cfx.t^) 

2Re{  M (q ,x  - x ' )exp t-iq (x  - x')  • y L } 6 (ty  - ty)  , 

(D- 12  ) 

N(X,Y)  ~ffd*  dt'  N(x.x',y,tx  - t') 


:(x',t')[6(t'  - t ) - Ht  - ty>] 


(D-13) 


We  abbreviate  these  further  by  writing 


B S = fdX  B(Y  ,X)  <s(X)> 

1 1 J 1 P 

R S = fdX  B( Y ,X)  <s(X)  s (X  )) 

1 12  J 1 2 p 


N M = f dY  N(X  .Y)  (s (X  ) I (Y) ) 

2 12  J 2’  1 p 


(D-14) 


(D-15) 


(D-16) 


and  so  on.  The  kernel  B describes  the  optical  coupling  from  mirror  to 
imate,  while  N describes  the  feedback  coupling  from  image  to  mirror. 
Until  the  very  end  of  the  calculation  we  take  the  atmospheric  input 
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a(X)  to  be  given  and  average  only  over  the  photon  statistics.  The  vector 
equations  are 


I 

1 


+ B S 
1 1 


(D-17) 


S 

1 


= A + N I 
1 1 


(D-18) 


these  being  respectively  the  linearized  form  of  Eqs . (3.2)  and  (3.8). 
Because  the  subtracted  long-term  average  term  i n Eq . (D-13)  satisfies 
Eq . (3.11),  we  have 


N I =0 
1 01 


(D-19) 


and  therefore  Eqs.  (D-17)  and  (D-18)  give 

S1  = Ai  + • (D-20) 

Equation  (D-20)  becomes  identical  with  Eq . (3.14)  after  averaging  over  the 
tremor  amplitudes  c(x,t)  and  dropping  the  <|r  term.  The  bivector  equations 
are 


I 

12 


I I + 
01  2 


B M 
1 12 


(D-21 ) 


S 

12 


S A 
1 2 


+ N M 
2 12 


(D-22  ) 


A I + 
1 2 


N 1 + N A 

1 12  1 12 


(D-23  ) 


Those  hold  only  for  t > t^.  For  example,  Eq . (D-21)  states  that  the 

image  at  time  t^  is  related  by  the  kernel  B to  the  mirror  at  time  t , 

irrespective  of  the  state  of  the  image  at  a previous  time  t . The  re- 

2 

lation  would  bo  untrue  for  t < t , since  then  the  image  at  t would  be 

\ 2.  2 
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disturbed  by  feedback  from  the  events  at  t . The  other  essentially  new 
feature  in  the  bivector  equations  is  the  last  term  of  Eq . (D-23).  Equa- 
tion (D-23)  states  that  the  mirror  at  time  t ^ is  related  by  the  feedback 

kernel  N to  the  image  at  all  earlier  times  t , if  one  also  knows  the 

Y 

state  oi  the  image  at  another  previous  time  t . The  term  N I describes 

2 112 

the  feedback  from  all  times  t / t . The  term  N A is  the  additional 

* ^ 1 12 

correlation  between  mirror  at  ^ and  image  at  t^  produced  by  direct 

feedback  from  t to  t . 

2 1 

The  bivector  equations  become  even  simpler  when  expressed  in  terms 
of  the  reduced  bi vectors  (D-9)  through  (D-ll) — namely, 


K 


12 


B Z 
1 12 


(D-2d) 


N Z 
• 12 


(D-25 ) 


12 


= N (K  + 
1 12 


A ) 

12 


(D-26) 


still  with  the  restriction  t > It  is  permissible  to  substitute 

from  Eq . (D-26)  into  Eq . (D-21)  oi  Eq , (D-25)  without  violating  the  time 

sequence,  but  it  is  not  permissible  to  substitute  Eq.  (D-21)  into  Eq. 

(D-26).  We  thus  obtain  a closed  equation  for  K onlv. 

12  * ’ 


K = B N (K  + A ) 
12  1 1 12  12 


(D-27 ) 


with  J to  be  derived  from 
12 


12 


= N N (K  + A ) 
1 2 12  12 


(D-28) 


Equation  (D-28)  is  symmetric  between  the  points  and  X^ , and  therefore 
holds  regardless  of  the  time-order.  To  obtain  a symmetric  equation  for 
wc  operate  on  Eq.  (D-27)  with  the  kernel  (1  - B^N^)  which  commutes 


2 04 


with  B^N^.  This  gives 


(1  - B N ) ( 1 - BN  ) K = B N A - B N B N A 
11  2 2 12  11  12  11221 


(D-29) 


‘till  ior  t^  > t^  only.  Now  all  terms  in  Eq  . (D-29)  are  symmetric  except 

H.^.A  . We  add  to  the  right  side  the'  term  B_  N<  A which  is  zero  for 
•1112  2212 

t.  > t<>1  Thc  resuit  i-s  symmetric  and  must  therefore  hold  iur  all  t ,t 
12  12 
We  have  thus  proved 


(1  - BN  ) (1  - BN  ) (K  + A ) = A 

11  2 2 12  12  12, 


( D-  3 0 ) 


Now  using  Eq.  (D-28)  and  the  identity 


(1  - N x B x ) Nx  = N’x  (1  - B i N x ) , 


(D-31 ) 


we  deduce 


(1  - N B ) (1  - N B ) J = N N A 

11  2 2 12  1 2 12 


(D-32) 


Combining  this  with  Eqs . (D-9)  and  (D-20)  , we  obtain 


(1  - N B ) (1  - N B )S  = A A +NNA 

11  2 2 12  12  1 2 12 


(D-33 ) 


In  the  whole  calculation  up  to  this  point  it  was  essential  to  refrain 
from  averaging  over  the  tremor  amplitudes  c(x,t)  that  occur  in  the 
kernels  B and  N . The  products  BN  that  appear  in  Eq . (D-30)  are  in  the 
wrong  order  for  averaging,  since  they  contain  tremor  amplitudes  referring 
to  different  times  and  so  Eq . (3.6)  cannot  be  used.  Only  in  the  final 
result  (D-33)  the  products  appear  in  the  order  NB,  We  can  now  do  the 
tremor  averaging  by  Eq . (3.6).  We  drop  the  subtracted  ('-term  from  N 
since  it  becomes  negligible  after  the  tremor  averaging.  We  have  then 
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nb  = r 


(D-34 ) 


with  F cefincd  by  Eq . (3.15),  and  Eq . (D-33)  has  the  solution 


S = Q Q [A  A + N N A ] 
12  12  12  1 2 12 


(D-35) 


with  the  kernel  Q defined  by  Eq . (3.20).  Tremor  averaging  can  be  done 

in  the  product  N N A since  A makes  the  times  t and  t equal. 

' 1 2 12  12  12 

To  evaluate  D according  to  Eq . (1.5),  we  need  the  quantity 


< [s(x  ,t ) - s(x  ,t ) |2> 

1 2 p 


S - 2 S + S 
11  12  22 


- (Q  - Q ) (Q  - Q ) A A + N N A 1 

*1  x2/  l 2 j 2 12 


(D-36) 


allowing  ourselves  a slight  abuse  of  notation  in  the  last  line.  Now 
let  denote  the  constant  vector 


C(Xi)  = 1 


(D-37) 


A constant  phase  shift  produces  no  effect  on  the  image;  thus 


r C = B C =0  , (D-38) 

11  11 


as  can  be  verified  immediately  from  Eq.  (D-12).  Equation  (D-38)  implies 


Q C = C , (D-39 ) 

11  1 


and  therefore 


QC  - Q C = C - C = 0 . (D--10) 

^1121  1 2 
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Therefore  we  may  add  inside  the  square  bracket  in  Eq . (D-3j)  the  terms 


/,2  ,2\  1 

/ 2 

2 \ 

(A1  + Ao  ) = - “ 

A C + C 

A 1 

\ 1 2/  2 

\ 1 2 

1 2) 

the  added  terms  Riving  zero  when  operated  on  by  the  kernel  (Q  - Q ) 
(Qj  ~ Q,^).  We  have  then 

< [«  Cx  , t ) - s(x  t)  2) 

1 2 p 


(Q1  " V(<*,  - vf-  \ (A1  - Ao)2  * *’  * * 1 

1 “ 1 2^21  2'  1 2 12j 


(D- 42) 


We  finally  substitute  Eq . (D-42)  into  Eq . (1.5),  expand  the  abbreviated 
notations,  and  perform  the  average  over  atmospheric  ftatistics  using 
Eqs . (3.21),  (D-8),  and  (D-]7).  The  result  is  Eq . (4,7), 
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Appendix  E 

TABLE  01  COEFFICIENTS  c (3 ) , klc),  1(5) 
APPEARING  IN  EQUATIONS  (5.38),  (5.17)  AND  (5.19) 

l 


i 


. 


I 

f 


ion 
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Appendix  E 


The 

1 (s  ) is 


TABI.E  OF  COEFFICIENTS  e(£),  k O ) , TO) 
APPEARING  IN  EQUATIONS  (5.38),  (5.17)  AND  (5.19) 


2 

e 

c(S ) 

k(S) 

1 (?) 

0 

0 

0 

0.5 

0.5 

0.2 

0.0370 

1.2851 

0.6061 

0.598  1 

0.1 

0 . 0690 

2.6151 

0.7255 

0.6118 

O.G 

0 . 0968 

3.9505 

0.8532 

0.6632 

0.8 

0.1212 

5.21  17 

0.9808 

0.6688 

1.0 

0.1 129 

6.2832 

1.0929 

0.6598 

1.2 

0.1622 

6,9777 

1 .1619 

0.6350 

1.4 

0.1795 

7.0502 

1.1591 

0,5900 

1.6 

0.1951 

6.1821 

1 .0200 

0.5153 

1.8 

0.2093 

3 .9817 

0.6692 

0.3867 

u 

0.2222 

0 

0 

0 

maximum  value  of  k(5)  is  1.1717  at  S = 1.30.  The  maximum  of 
I.G689  at  p = 0.78. 
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Appendix  F 


PROOF  OF  EQUATIONS  (7.2-1),  (7.25),  AND  (7.26) 
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Appendix  F 


V 


PROOF  01  EQUATION’S  (7.2  1),  (7.25),  AND  ,7.20) 

The  proof  is  divided  into  two  parts.  First,  we  assume  that  L exists 
satisfying  Eq . (7.25),  and  prove  Eqs . (7.21)  and  (7.26).  Second,  we 
prove  the  existence  and  uniqueness  of  L.  The  first  part  of  the  proof  is 
similar  to  the  proof  of  Eqs.  (2.11)  and  (2.11)  in  Appendix  A.  Onlv  the 
details  are  more  delicate  because  we  have  here  to  deal  with  non-commuting 
kernels.  It  is  no  accident  that  the  second  equality  (2.11)  has  pre- 
cisely the  some  structure  as  Eq.  (7.27). 

Given  that  L exists,  we  define  two  causal  kernels  G and  H by 


G = (1  + LT)  I, 

H = (1  - KT) ~ 1 (L  + K)  T 

Then 

(1  - NT)  1 = (1  +■  H)  (1  - GT) 
and  Eq . (7.25)  becomes 


(1  - GT)  (V  + T_1)  (1  - TG  ) = T 1 


(F-l) 
(F-2  ) 


(F-3 ) 


(F-d ) 


Now  I)  is  given  by  Eq . (7.16),  with  V replacing  (-1  2 U)  according  to 
Eq.  (7.18).  Substituting  from  Eqs.  (F-3)  and  (F-l)  into  (7.16),  we  find 
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(F-5 ) 


D , = (S  )_1  Tr{ C [ ( 1 + H)  T_1  (1  + H ) 

S X 

*“  *f*  “1  "t"  1 

+ (1  - KT)~A  (KTK  - T~  ) (1  - TK  ) ]} 

= (Si)_1  TRfc;ilT”1H  + G r G+  + HG  + G H^~ ]} 

The  last  two  terms  i n (F-5)  give  zero  trace,  since  the  convolution  of  two 
causal  kernels  is  a kernel  that  is  continuous  and  therefore  vanishes  at 
equal  times.  The  term  quadratic  in  11  is  non-negative  since  C and  T are 
positive-definite.  Therefore  D attains  its  minimum  value  when  11=0, 

s 

and  Eq . (7.21)  is  proved.  The  minimum  is 

D = (S  Tr  C(G  + G ) " . (F-6) 

S : 

By  Eq.  (F-l),  the  difference 

L - G = LTG  (F-7 ) 

is  a convolution  of  two  causal  kernels  and  therefore  has  zero  trace. 
Hence,  Eq . (F-G)  implies  Eq . (7.26),  and  the  lirst  part  of  the  proof  is 
complete . 

’■Ve  next  prove  t'.c  uniqueness  of  L.  If  two  causal  kernels  L , Lt) 
satisfy  Eq . (7.25),  then  by  (7,29), 

-1  "f-  f -1 

(1  + I.yT>  (1  + Lj  T)  = (1  + L^T)  (1  + L^T)  . (F-8) 

But  the  lei t side  of  Eq . (F-8)  is  causal,  and  the  right  side  is  anti- 
causal.  Therefore  both  sides  must  equal  1,  and  the  uniqueness  is  proved. 
To  prove  the  existence  of  L,  we  follow  a simple  modification  of  the 
argument  used  by  Newton  and  Jost  (1955)  to  solve  the  multichannel  in- 
verse scattering  problem.  The  Newton-Jost  analysis  is  a generalization 
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to  noncommuting  potentials  oi  the  original  inverse  scattering  theory  of 
Geliand  and  Levitan  (1951).  We  consider  the  equation 


+ 

G (t)  + 


V(t 


4- 

t ')  TG  (t  ')  dt 7 = V ( t ' ) 


(F-9 ) 


as  an  integral  equation  for  the  kernel  G (x,x',t)  on  the  half-space 
t < 0.  This  is  an  equation  of  Fredholm  type  [see  Riesz  and  Sz.-Nagy 
(1955),  pages  161  to  190].  The  kernel  V(x,x',t  - t'),  with  x and  x' 
restricted  to  a finite  mirror,  is  completely  continuous.  Therefore  the 
Fredholm  alternative  holds;  either  Eq.  (F-9)  has  a solution,  or  the 
corresponding  homogeneous  equation  with  zero  on  the  right  side  has  a 
nontrivial  solution.  Bui  any  solution  f(t)  of  the  homogeneous  equation 
would  satisfy 


T 

f (t) 


T 1 ( t ) dt  + 


T 

f 


(t)T  V (t  - t 


)Ti 


1 ')  dt  dt  ' = 0 , 

(F-10) 


which  is  impossible  since  both  T and  V are  positive-definite.  So  the 

-f- 

existence  of  G (t)  satisfying  Eq . (F-9)  for  t < 0 is  proved.  If  we 

f* 

extend  the  definition  by  writing  G (t)  =0  for  t > 0,  then  G is  an 
anticausal  kernel.  Since  Eq . (F-9)  holds  for  t < 0,  the  kernel 


L = V - G - VTG 


(F- 11) 


is  causal.  We  define  L by  Eq . (F-ll),  which  may  also  be  written 

1 + VT  = (1  + LT)  (1  - G T)"1  . (F— 1 2 ) 

The  adjoint  of  Eq . (F-12)  implies 


1 + VT 


-1 

(1  - GT) 


+ 

(1  + L T) 


(F-13) 
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B>  the  same  argument  that  we  used  in  proving  uniqueness,  Eqs.  (F-12) 
and  (F-13)  imply 


t 


(1  - GT)  (1  + LT)  = (1  + L T)  (1  - G T)  = 1 


(F-14 ) 


Equations  (F-12)  and  (F-14)  together  imply  (7.29),  showing  that  L defined 
b>  (1-9)  and  (1-11)  gives  the  desired  solution  of  (7.25).  Also,  Eq . 
(F-ll)  shows  that  G defined  by  (F-9)  is  the  adjoint  of  G defined  by 
(1-1).  This  completes  the  second  part  of  the  proof. 


It  is  of  interest  to  observe  that  the  integral  equations  that  de- 
termine the  optimum  feedback  kernel  — namely,  Eqs.  (7.25)  and  (F-9)  — have 
the  same  formal  structure  as  the  nonlinear  and  linear  integral  equations 
in  the  Gelf and-Levi tan  (1951)  theory  of  the  inverse  scattering  problem. 
The  nonlinear  equation  is  Eq . (I)  on  page  319,  and  the  linear  equation 
is  Eq.  (II)  on  page  321  of  the  Gel fand-Levi tan  paper.  The  correspondence 
between  the  Gelfand-Levitan  kernels  and  ours  is  the  following.  Spectral 


function  f - atmospheric  kernel  V.  Transformation  kernel  K - -L,  and 


transformation  kernel  K -*  -G.  Spa'*?- coordinate  x ->  time-coordinate  t. 

An  even  closer  correspondence  exists  between  our  equations  and  those  of 
the  Marchenko  (1955)  formulation  of  the  inverse  scattering  problem. 

Our  Eqs.  (7.25)  and  (F-9)  correspond  to  Marchenko’s  Eqs . (II)  and  (III) 
wi  th 


f 


V , 13  — L a nd 


■0 


The  kernel  T has  no  analog  and  is  replaced  by  unity  in  the  Gelland- 
Levi tan-Marchenko  theory. 


It  is  plausible  that  an  analogy  should  exist  between  image  improve- 
ment and  the  inverse  scattering  problem,  since  in  both  problems  the  aim 
of  the  theory  is  to  determine  the  structure  of  a scattering  medium  by 
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i 


observing  its  effects  on  the  waves  that  pass  through  it.  However,  the 
preciseness  with  which  this  analogy  is  reflected  in  the  structure  of  our 
equations  came  as  a surprise.  The  logical  basis  of  the  analogy  remains 
to  be  explored  in  detail. 
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